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‘This course of mathematical analysis, 

which can be recommended fo lirst- - 
year University students, includes. rhe 
theory of real numbers, the theory of 
limits, differential and integral calculus 
of functions in one variable, the theo- 
ry of number series, differential cal- 
culus of functions in several variables, 
the theory of implicit functions, and 
geometrical applications of the anal- 
ysis. 

A peculiarity of this textbook is the 
presence of an introductory propee- 
deutic chapter, in which the class of 
the basic notions and ideas consti- 
juting the course of mathematical anal- 
ysis is elucidated in the process of 
considering a simple problem of @ 
particle moving along a line. Another 
peculiarity of the book is the exposi- 
fion, on the basis of the apparatus of 
mathematical analysis, of algorithms 
for computing the values of elemen- 
tary functions and definite integrals 
and algorithms for calculating extrema 

based on the gradient methods. The 
exposition of the theory of the en- 
yvelope and the discriminant curve is 
ymore complete here than in other 
study aids. The Appendix is devoted 
fo the axiomatic theory of real num- 
bers and to the proof of the theorem 
ef isomorphism and its various realiza- 

tions. 

The textbook was awarded the 1980 
State Prize of the USSR, 
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PREFACE 


This book is based on the lectures read by authors at Moscow 
State University for a number of years. 

As in Part 1, the authors strived to make presentation systematic 
and to set off the most important notions and theorems, 

Besides the basic curriculum material, this book contains some 
additional questions that play au important part in various branches 
of modern mathematics and physics (the theory of measure and 
Lebesgue integrals, the theory of Hilbert spaces and of self-adjoint 
linear operators in these spaces, questions of regularization of Fourier 
series, the theory of differential forms in Euclidean spaces, etc.). 
Some of the topics, such as the conditions for termwise differentiation 
and termwise integration of functional sequences and functional 
series, the theorem on the change of variables in a multiple integral, 
Green’s and Stokes’s formulas, necessary conditions for a bounded 
function-to be integrable in the sense of Riemann and in the sense of 
Lebesgue, are treated more generally and under weaker assumptions 
than usual. 

As in Part 1, we discuss in this book some questions related to 
computational mathematics, including first of all approximate cale 
culation of multiple integrals in the supplement to Chapter 2 and 
calculation of the values of functions from the approximate values 
of Fourier coefficients (A.N. Tichonoff’s regularization method) 
in the Appendix. 

The material of this book, together with that of Part 4 published 
earlier, constitutes an entire university course in mathematical] 
analysis. 

Note that throughout this text Part 1 is referred to as Volume 4 
and designated [1]. It should also be stressed that when reading this 
book Chapter 8, The Lebesgue Integral and Measure, Chapter 11, 
Hilbert Space, and ajl the supplements may be skipped without 
jmpairing the understanding of the rest of the text. 
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The anthors feel deeply indebted to A.N. Tichonofi and A.G.'Svesh- 
nikoy for much valuable advice and numerous profound criticisms, 
to Sh.A. Alimov, who has dono more than just editing this book, 
to L.D. Kudryavtsev and S.A. Lomov fora great number of valuable 
criticisms, to P.S. Modenov and Ya.M. Zhiloikin, who have made 
available to the authors materials on field theory and approximate 


methods of evaluating multiple integrals. 
V. flying E. Posnyak 
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CHAPTER 1 


FUNCTIONAL SEQUENCES 
AND FUNCTIONAL SERIES 


In this chapter we shal] study sequences and series whose members 
are not numbers but functions defined on some given set. Such 
sequences and series are widely used to represent the functions and 
to compute them approximately. 


1.4. UNIFORM CONVERGENCE 


1.1.1. The functional sequence and the functional series. Let {x} 
be some set*. Then, if we assign to each n of the natural numbersi,2,... 
20, 7,... by a@ definite rule some function f, (x) defined on {zx}, the 
set of the numbered functions f, (z), fo (x), - - «> fn (Z), -- - is said to 
be a functional sequence. 

The individual functions f, (z) are called members or elements of 
the sequence, and {z} is its domain of definition or simply domain. 

The symbol {f, (z)} will be used to designate a functional sequence. 

The formally written sum 


a Un (x) = Uy (x) + Uz (Z)+... Uy (z) +... (1.1) 
of an infinite number of elements of a functional sequence {u,, (z)} 
will be called a functional series. 

The terms u, (x) of that series are functions defined on some set {z}. 

The set {z} is called the domain of definition, or domain, of the 
functional series (4.1). 

As in the case of the number series, the sum of the first 7 terms of 
(1.4) are called the nth partial sum of that series. 

It should be stressed that the study of functional series is perfectly 
equivalent to the study of functional sequences, for to every functional 
series (1.1) uniquely corresponds a functional sequence 


Si (x), So (z)y o00y Sq (By oe (1.2) 


* In particular, by {z} we may imply both the set of points of a straight 
line and the set of points z = (z,, Zo, . - -; Zm) of a Euclidean space E™. 
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of its partial sums, and, conversely, to every functional sequence 
(1.2) uniquely corresponds a functional series (1.1) with terms 

Wy (xt) = Sy (x), Up (Z) = Sp (z) — Sy-y (x) for ne 2 
for which the sequence (4.2) is a sequence of partial sums. 

Were are examples of functional sequences and scries. 


¥,(x)} £(2) | f(x) 
i | 1 
0} ) x 0 1/2 1x Ot fn 1 x 
Fig. 4.4 


Example 1. Consider a sequence of functions {f, (z)} each defined 
on the closed interval 0 < x <1 and having the form 


, ({1—nz) when O<Sz<ci/n, 
In (2), = 0) 


Figure 1.4 gives the graphs of the functions jf, (z), f. (z) and fp (2)- 
Example 2. As an example of a functional series consider the fol- 
lowing power series in f: 


ia G 


(1.3) 


when I/n<a<l. 


* 


- ry ? n 
14S Seattet+ at... tate. (1.4) 
lect 

Notice that the (2 + 1)th partial sum of (4.4) differs from the 
Maclaurin expansion of e* only by the remainder term 2,2, (z). 

1.1.2. Convergence of a functional sequence at a point and on a set. 
Suppose a functional sequence (or series) is defined on a set {z} 
Fix an arbitrary point x, of {z} and consider all the elements or 
the sequence (or the terms of the series) at 25. We obtain a number 
sequence (or series), 

If this number sequence (or series) converges, the given functional 
sequence (or series) is said to converge at x9. 

The set of all points 2 at which a given functional sequence (or 
series) converges is called the domain of convergence of that sequence 
(or that series). 

At various particular cases the domain of convergence may cither 
coincide with the domain of definition or form a part of the domain 
or be an empty sel. 
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Corresponding examples are given below. 

Suppose that {f, (z)} has {z} as the domain of convergence. The 
collection of all limits taken for all the values of x from the set {z} 
forms a well-defined function f (z) also given on {z}. 

This function is called the limit function of the sequence {f, (z)}. 

Quite similarly, if the functional series (1.1) converges on some 
set {z}, then on that set a function S (z) f 
is defined which is the limit function of (2) 
the sequence of its partial sums and is { 
called the sum of that series. 

The sequence (4.3) of Example f above 
converges on the whole of the closed interval 
O<r<i. 

Indeed, f, (0) = 1 for all integers n, 

i.e. at the point xz=0 the sequence (1.3) 
converges to unity. 0 | 2 

But if we choose any zin the half-open 
interval 0 < 2< 1, then all f, (x) begin- Fig. 1.2 
ning with some integer (dependent of 
course on x) will be zero. At any point z of O<( z <1 therefore 
(4.3) converges to zero. 

So, (1.3) converges on the entire interval 0 < zt <1 to a limit 
function f(x) having the form 


1 when r=0 
f(e)=| O when 0O<izr<cl. 


The graph of that limit function is given In Fig. 4.2. 

We stress that f (x) is not continuous on the interval O<2<cl 
(it is discontinuous at z = Q). 

Now we turn to the functional series (1.4) of Example 2. 

That series converges at any point x of an infinite straight line 
and its sum equals e*. For the proof the reader is referred to Chap- 
ter 13 of [4] (see Example 3 in Section 13.1.1)*. 

1.1.3. Uniform convergence on a Set. Suppose that a sequence 


hh (x), he (x), e 8 sy th (zx), eee (1.5) 


converges on a set {z} to a limit function f (z). 

Definition 1. The sequence (1.5) is said to converge to the function 
f (z) uniformly on the set {x} if for any © > 0 we can find an integer 
N (e) such that given n > N (e), for all x of {x} we have** 


fn (t) —f(x)l[<e. (1.6) 


* This proof, however, immediately follows from the Maclaurin formula 

for e~ and from the fact that the remainder in that formula tends to zero for all x. 

** If by {z} we mean a set of points z = (21, . - ., m) of a space E™, then 

we obtain a definition of uniform convergence of a sequence f/f, (z) = 
= fy (24, Loy - » +} Lm) Of functions of m variables. 
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Remark 1. What is very essential to this definition is that .V 
depends only on ce and is independent of z. Thus for any ¢ > 0 we 
can find a universal integer N (e) beginning with which inequality 
(1.6) holds at once for all x of {z}. 

Remark 2. The convergence of {f,, (x)} on {x} does not at all imply 
ets uniform convergence on {zx}. Thus the sequence (1.3) of Example 1 
abave converges on the entire interval [0, 1] (that was established 
above). 

We now prove that that sequence does not uniformly converge on 
{, 4J. Consider a sequence of points z, = 1/2n (n = 1, 2, ...) 
belonging to [0, 4). At each of the points (i.c. for every 7) the rela- 
tions f,, (7,) = 41/2, f (z,) = O hold. Thus for any n 


ifn (7,) — f (Z,) [= 1/2, 


ic. when © << 4/2, inequality (1.6) cannot hold for all points x of 
{(Q, 1) at once, whatever n we have. 

Remark 3. Note that the uniform convergence of {f, (x)} on {2} 
to f (z) is equivalent to the convergence of a number sequence {e,} 
whose elements €, are the suprema of the function | /, (rz) — f(z)! 
on {zy}. 

Remark 4. It is immediate from Definition 1 that if {f, (7)} uni- 
formly converges to f (x) on the whole of {z}, then it docs so on any 
portion of fr}, 

Now we give an example of a functional sequence uniformly con- 
verging on some sel {7}. Consider again the sequence (1.3), this time 
nat on the whole of (0, 1], but on a closed interval (6, 1], where 6 is 
a fixed number in the interval 0 <6 <4. For anv such 68 we can 
find an integer beginning with which all the clements f, (7) are zero 
on {6, 4]. Since the limit function f (z) is also zera on [4, 1], every- 
where on [6, 4] we have | f, (2) — f (x) | < ¢foranyec > 9 beginning 
with the indicated integer. This proves the uniform convercence of 
(1.3) on 8, 4). 

Definition 2. A functional series is said to be uniformly convergent 
on a set {x} to its sum S (zx) if the sequence {S,, (2)} of tts partial sums 
converges uniformly on {x} to the limit function S (2). 

Prove on your own that the functional series (1.4) of Example 2 
above converges to its sum e° uniformly on every interval —r <r < 
<2 r, Where ris any fixed positive number’. 


* Jo prove this it suffices to evaluate the remainder 2,4) (z) in the Maclourin 
formula for e®. This remainder, which is the difference between ¢* and the 
(n> IMh partial sum of (1.4), satisfies for all x in the interval --r 7 or -7 rat 
ance the inequality 


ret 


eicee CS ay er 


see formula (8.62) in lj. 
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{.1.4. Cauchy criterion. The following two main theorems hold. 

Theorem 1.1. Fora functional sequence {f, (x)} to converge uniformly 

on a set {x} to some limit function it is necessary and sufficient that for 
any € > 0 we should be able to find N (&) such that 


| fn+p (z) — fr (x) <e (1.7) 


for alln>WN (e), all natural p (p = 1, 2,...) and all x of {zx}. 
Theorem 1.2. For a functional series 


2, Up, (2) (1.8) 


to converge uniformly on a set {x} to a certain sum it is necessary and 
sufficient that for any & > 0 we should be able to find N (€) such that 
n+p 

| un (x)|<e (1.9) 

h=o=n+if 
for alln > WN (e), all natural p and all x of {z}. 

Theorem 4.2 is a consequence of Theorem 1.4: it is sufficient to 
notice that under the modulus sign on the left of (1.9) we have the 
difference S,4, (x) — S,, (x) of the partial sums of (4.8). 

Proof of Theorem 1.1. (1) Necessity. Let {f, (x)} converge uniformly 
on {z} to some limit function f (x). Choose an arbitrary ¢ > 0. For 
the positive number e/2 we can find N such that for all x > WN and 
for all x of {z} 


ifn (t) — f (@) | < 2/2. (4.10) 


If p is any natural number, then for xn > N andifor all x of {z} 
all the more so 


l fnep (2) — f (x) |< 2/2 (1.44) 


Since the modulus of a sum is not greater than the sum of moduli, by 
(4.10) and (4.14) wo get 


| fntp ()— Sn (t) | =| [Pn p (2) — 7 (2) +(e) — fn IS 

<1 fntp (@)— f(z) [+1 fF (2) — fn (2) <e 
for all 2 > WN, all natural p and all x of {z}). Necessity is proved. 

(2) Sufficiency. Inequality (1.7), together with the Cauchy criterion 
for the number sequence, implies the convergence of {f, (x)} for any 
given « of {z} and the existence of a limit function f (z). 

Since (4.7) holds for any natural p, by proceeding in this inequality 


to the limit as p — oo (see Theorem 3.13 in [1}) we find that/for all 
n>WN and for all x of {x} 


Lf (2) — fh (z) p<. 
By the arbitrariness of « > 0, sufficiency is proved. 
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1.1.5. Sufficient tests for uniform convergence. Depending on 
convenience we shall formulate the tests for uniform convergence 
either in terms of sequences or in terms of series*. 

We introduce a new concept to formulate the first test. 

Definition. A sequence of functions {f, (z)} is said to he uniformly 
bounded ona set {7}, if there is a real mumber -\ such that for all x of 
{x} and for all n we have | fy, (7) | A. 

Theorem £.3 (Abel-Dirichlet test). Let{ 


‘ 


NY aey (x) ee, (ee) 


bea junctional series. It converges uniformly ona set {z), if the following 
fico conditions hold: 
(i) the sequence {v, (x)} is noninereasing on {x} and untformly 
converges on fx} to sero; 
xu 


oe % e ® 
(ii) the series \’ uy (7) has a sequence of partial sums uniformly 
es 


bounded on {7}. 

The proof coincides almost textually with that of the corresponding 
(est for the canvergence of number series (see Section 13.5.2 in {1}). 
The render should carry it out on his own. 

Example 1. As an illustration, we discuss uniform convergence 
of the series 


~* 


s ad (1.12) 
hs " 


Since {1/c} (for all 7) is not increasing and tends uniformly to zero, 
by the Abel-Dirichlet test, (4.12) converges uniformly on any set on 
which 
NS sin kr (1.123) 
Kf 
has a uniformly bounded sequence of partial sums. We compute and 
evaluate the mth partial sum S, (x) of (1.13). 
Summing the identity 
i 


° a e , qv 
Jsin —- sin Iex =~ cos ( — > ) r—cos G a xz) r 


z 
over all & from { to nr, we get 


- &« J ; 5 
2sin=-4,, (z) cz COS ~~ — COS (n see =). 


- on 
need 


* By what was said in Scction 1.4.4 both formulations are equivalent. 
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From this 
cos cos (n+ 5) x 
Sy (2) a re 
2 sin 5 
Therefore for all n 
| Sn (2) IX4—sy. (1.44) 
six 5 


From inequality (1.14) it is evident that the sequence {S, (z)} 
of partial sums of (1.13) is uniformly bounded on any fixed in- 
terval containing no points z,=2am(m=0, +1, +2, ...), for 
on any such interval |sin S| 

Thus we have proved that (1.12) converges uniformly on any inter- 
val containing no points z, = 2am, where m = 0, +4, +2,.... 

Theorem 1.4. (Weierstrass test). [f a functional series 


has a positive infimum. 


S 
k= 


uy, (x) (1.45) 


is defined on a set {x} and if there is a convergent number series’ 
foo) 


>) Cy such that for all x of a set {x} and any integer k 


— 
—_— 


[ tp, (x)|<cy, (1.16) 


then (41.15) converges uniformly on {zx}. 

Concise statement: a functional series converges uniformly on a 
given set if it can be majorized on that set by a convergent number 
series. ~ 

Proof. By the Cauchy criterion for the number series cp, 

k= 
for any e>0O there is N(e) such that for all nN (e) and for 
any natural p 


ntp 


Za Ch. (1.17) 


From (1.16) and (1.17) and from the fact that the modulus of 
a sum is not greater than the sum of moduli we get 


n+p 


2X ua (2) 


=n-+ 


(for all n2WN (e), all natural p and all a of {z}). 


<E 
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t3 
tr 


By the Cauchy criterion, (1.15) converges “uniformly on {z}, 
Thus the theorem is proved. 
Example 2. The series 


x 
Qe) ibe 
“\) Sill ow . . 
oe where 6> 0, 


het 
converges uniformly on the whole of an infinilo straight line, for on 
the entire straight line 


f sinks [4 
| ee a pind * 
nm » 
ae 
and the number series >’ FAts » with 6 > Q, converges (see Chap- 
kew{ 


ter 13 in [4]). 

Remark 1. The Weierstrass test is not necessary. 

Indeed, it was established above that (1.12) converges uniformly 
on any inferyal containing no poinis rz, = 21m Oy =D, 4-4, 3 2,.. 
...) In particular, (1.12) converges uniformly on [x/2, 37/2). On 
this interval, however, the absolute value of the &th term of (1.12), 


len has a supremum equal to 1/A, i. c. the majorant » 4 is 


h=! 
known to be a divergent harmonic series. 

Theorem 1.5 (Dini test*). Let {fy (x)} be a sequence nol decreasing 
(or not increasing) at each point of a close interval [a, b] and converging 
on that interval to a timit function f (xz). Then, if all the elements of the 
seguence f,, (x) and the limit function f (x) are continuous on {a, b], the 
convergence of {f,, (x)} is uniform on {a, 0). 

Proof. For definiteness suppose that {f, (z)} is not decreasing on 
[a, b] (the case of nonincreasing sequence can bo reduced to this case 
by multiplying all the elements of the sequence by —4). 

Put 


a (x) — f(z) ~— fn (x). 


The sequence fr, (z)} has the following properties: 

(1) all r, (z) are nonnegative and continnous on [a, b); 

(2) fr, (z)} is nonincreasing on [a, 6); 

(3) at each point z of fa, b] there is a limit lim r, (x) = 0. 


Tie mw 
We want to prove that {r,, (z)} converges to zero uniformly on 
(a, J. It suffices to prove that given any © > Q, thore is at leas! one n 
such that r, (7) < © at once for all x of (a, b} (then by property (2) 
above r, (x) << e for all subsequent integers too). 


eee 


* Ulisse Dink (1875-1048) ts an Italian mathematician. 
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Suppose given some & > O wecan find no nsuch thatr, (x) << « at 
once for all x in {a, 6]. Then for any n there is a point z, in {a, 8} 
such that 


Tn (Zp) & €. (41.18) 


By the Bolzano-Weierstrass theorem, we may choose a subsequence 
{z,,} of {c,} converging to some point zy in [a, b] (see Section 3.4 
in {1]). 


All r,, (x) (given any m) are continuous at zy. Therefore for any m 


tim im (Tn,)=Tm (29). (1.19) 


On the ether hand, on choesing for any given m some n; greater than 
m we obtain (by property (2) above) 


Tm (Tn,) 22 Tn, (Tn,)- 
Comparing the last inequality with (1.18) we have 
Pm (Zn, )SzE (41.20) 


(for any given m and any 7, greater than m). Finally, comparing 
(1.49) and (4.20) yields ° 


Ym (Zp) 2 & 


(for any m). 

The last inequality contradicts the convergence of {r, (x)} to zero 
at x). This contradiction proves the theorem. 

Remark 2. Essential in the Dini theorem is the condition of mono- 
tonicity of {f, (x)} on [a, b], for a sequence nonmonotonic on [a, b] 
of functions continuous on [a, b] may converge at each point of [a, b] 
to a function f (x) continuous on [a, b], but without converging to 
it uniformly on [a, JJ. 

A sequence of functions f, (z) equal to sinnz for OS rt<ian/n 
and to zero fora/n <Cz2<n(n =1,2,...) may serve as an exam- 
ple. It converges to f (x) = 0 at each point of [0, 1] but is not uni- 
formly convergent on [0, x], for | fn (tn) —f (tn) |= 1, given 
z, == 0/2n for all 7. 

Remark 3. We formulate the Dini theorem in terms of series: 
if all terms of a series are continuous and nonnegative on a closed interval 
{a, b} and the sum of the series is also continuous on [a, b], then that 
series uniformly converges to its sum on {a, 6). 

Remark 4. The Dini theorem and its proof remain valid if we take any bound- 


ed closed set {x} instead of [a, b} in the theorem. It is customary to call such a 
set a compact sect. 


Example 3. The sequence {z"} uniformly converges to zero on 


a closed interval [ 0, z|- 


t= 
rey 
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. { 
Indeed, (1) for any z in | 0, = | the sequence converges to zern; 


n 


(2) all the functions 2° 
GUS ON | 0, 5 |i (3) {2"} is not increasing on | 0. =|. 


Thus all the hypotheses of the Dini theorem are satisfied. 

1.1.6. Proceeding to the limit term by term. Continuity of the sum 
of a series and of the limit function of a sequence. Consider a point @ 
on an infinite straight line. Let {z} be a set that may not contain a 
but such that there are points of the set {7} in any e-neighbourhood 
of a”. 

The following statement holds. 

Theorem 1.6. Leta functional series 


and the Jimit function zero are contin- 


NY oy, (2) (1.15) 
Kez} 


converge uniformly ona set {x} toa sum S (x). Further let all terms of 
that series have alta point aa limiting value 


lim uy, (z) == bp. 
Reon 
Then the function S {x} also has a limiting value at a, with 
lim S(z)-= “ lima, (xr)= \ by, (1.21) 
Aceon hz fusca ae | 
i.e. the limit sign lim and the summation sign “’ may be interchanged 
(or, as we say, we can proceed to the limit (orm hy term). 
Proof. We first prove that the number series “ }, converges. 
k=} 
By the Cauchy criterion applied to (1.15), given any € > 0, we can 
find N (e) such that 
| yey (Z) HE Upge (7) +... t+ Unty I <e (1.22) 
for all n >= N (e), all natural p and all z of {x}. 
Proceeding in (1.22) to the limit 2 —-a** we get 
| by ot Dye -+ oee™ On ap — £ <= Qe 
(for al) 7 > N (e) and all natural p). 


The Canchy criterion holds for \ 4, therefore. and the series 
he~] 
canVeErECs. 


“tn other words, ars the limit point of {7}. 
2 te do this we can use some sequence of points {7,5 converpim: to a. 


Ca. I. Functional Sequences cond Functions! Series 


CS Yad 
= 


~ 
° . PS ‘\ . 
Now we evaluate the difference S (x) — \’ , for all values of x iv 
Kt 
we 
e e » ‘a e 
a smnall neighbourhood of a. Since S (7) = ™’ uy (x) for all points of 


Ruaont 
— 


{zr}, given any x we have 


fo) n n ~ nm 
S(Q—- VSI N ua (a-N&IlS No um@- N 2 
y= a REI hen! 


From this we obtain for all x of {7} 


© Th n x 
IS(yan— Na l<] uw, (2) — S b, |---| » uy (2) | + 


; ot b 1.23 
—_ aoa hk e ( wad)? 
=n+) 


~ 
« ° 1 * ~ 
Take e > 0. Since the series »’ b, converges and since (41.15): 
k=1 
converges uniformly on {z}, for given € we can find » such that for 
all x of {vz} 


& & . 
| > b>. I<, | » > (<p. (1.24) 
A=n+] tren { 


Since the limit of a finite sum is equal to the sum of the limits of 


summands, for given e€ > 0 and the chosen r there is § > 0 such 
that 


| » uy, (x) — » bp, <> (1.25) 
i 


k=! ho 


for all x of {xz} satisfying the condition 0 < Jz—a]<é. 


Putting (4.24) and (4.25) in the right-hand sido of (1.23) we finally 
get 


|S(a)— & bn |< 


for all the points z of {2} satisfying the condition 0 << | zt —a i <d. 
This proves that the function S (z) has a limiting value al z =a 
and that equation (4.24) holds. Thus the theorem is proved. 
Now we formulate Theorem 4.6 in terms of functional sequences. 
If a functional sequence {f, (x)} converges uniformly to a limit 
function f (z) on a set {x} and if each element of the sequence has a 
limiting value at a point a, then f (x) also has a limiting value at a, 
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rp gett 


fim f(r) lim (tim f, (z)) = lim (lim f (x)), 


wert Lowe Nem Uo! 


Le. tke sign of the limit of a sequence lim and the sign of the limiting 


Tl 2 OO 
unluc of a function lim may be interchanged (or as we sau we can 
Ven 
proceed to the limit as x a element by element). 

Remark to Theorem 1.6. If under the hypotheses of Theorem 1.6 
we in addition require that @ should belong to {x} and that all terms 
u, (r) of (1.15) should be continuous at a (or respectively continuous 
wf aon the right or the left), then the sum S (x) of (1.15) is also 
continuous at a (or respectively continuous at a on the right or the 
Jeft). 

Indeed, in this case b, = uy, (a) and equation (1.21) becomes 


ao 
lim S (x) = -, uy, (a) = S (a), 
7? io 
which just implies the conlinuity of the function S (x) at a (or, if x 
tends to a from one side, the continuity of S (z) at a respectively on 
the right or on the left). 

Applying the abave remark to cach point of some closed interval 
la, 6) we arrive at the following main theorem. 

Theorem 1.27 .7f all terms of a functional series (all elements of a 
fitnctional sequence) are continuous on a closed interval fa, b] and if 
thai series (that sequence) uniformly converges on [a, b], then the suin 
of ihe series (the limit function of the sequence) is also continuous on 
fa, Oo). 

Remarks to Theorem 1.7. (1) In Theorem 31.7, instead of a closed 
interval [a, 6b] we may take an open interval, a half-open interval, 
n half-line. an infinite straight line or any set {z} dense in itself in 
soneral. (2) Essential in Theorem 1.7 is’the requirement on untform 
convergence, for a nonuniformly convergent sequence of continuous 
functions may converge to a discontinuous function (see Example (1.3) 
in Sections 1.1.4 and 1.1.2). 

Coneluding remark. Al! the theorems of this section are trae for 
sequences of functions given on a set {7) of £". 


1.2. TERM-BY-TERM INTEGRATION AND TERM-BY-TERM 
DIFFERENTIATION OF FUNCTIONAL SEQUENCES 
AND SERIES 


§.2.4. Term-by-ferm integration. The following main theorem holds. 
Theorem 1.8. ffa functional sequence {f, (7)} uniformly converges 
ren limi? function j (r) on a closed interval la, b} and tf everu function 
f, tr) isintegrahle on [a,b], then f(z) is also integrable on fa, 0), it 
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being possible to integrate {f, (x)} over [a, b] element by element, i.e. 
b 
lim \ fn (x) dx 


n~- oo 
"a 


b 
exists and is equal to \ f (x) dx. 


Proof. Choose ¢ >0. Then, by the uniform convergence of {f,, (z)} 
to f (x), ere is N (€) for ¢ > 0 such that for all nm > N (e) and for 
all x in [a, 0 


| fn (2) —f (2) | <-ygay- (1.26) 


If we prove that f (x) is integrable on [a, b], then, using the ordinary 
evaluations of integrals* and inequality (1.26), we get 


| l fn (x) dx— f (x) da| =| (fn (x) —f (x)] dn|< 


o 


b 
<\ lin @)-f@ lde<a ps ldze=4<e 


a 


(for all n> N (e)). 


b 
This proves that the limit lim \ fn (x) dz exists and is equal to 
TM Og 
a 
b 


| f (z) dz, and it remains for us to prove that f (xz) is integrable on 
{a, bd]. 

On dividing [a, b] by means of arbitrary pointsa = 2,<24,<... 
...<2%m = 6b into m closed intervals [z,_,, 7] (k = 1, 2,..., m) 


we agree to denotejhy w,, (f) (respectively by o, (f,)) oscillation on 
the kth closed interval [z,_,, z,] of f (z) (respectively of f, (x))** 


* We mean the following evaluations of integrals established in Section 10.6 
of [1]: (1) if a function F (xz) is integrable yon fa, b], so is { F (x) | with 


b 
| F@) dz} < \|F@ dz; (2) if f(z) and g(x) are both integrable on 
c 


a 
& 


b 
[a, b] and everywhere on it 7 (x) < g (x), then \ f(e)dr< | g (x) dz. 
a e 
** Recall that the oscillation of a function on a given closed interval is the 


difference between the supremum and infimum of the function on the given 
interval. 
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We want to see that given any et > O and any & = 1, 2,.... 7, 
we can find a sufficiently large nm for which 


¢ t m~ 
oy (Ayo, (Fy Toa: (1.27) 


Indeed, whatever x! and 2” of (2,_,. 2], we have 
P(r Fr) ESSE fn (2D LE Df 2 — fn QD P+ 
tf, lr") — Fitz") J. (4.28) 


By the uniform convergence of {f, (7)} to f (x), given any e > 0, 
there is x such that for any x in fa, b] inequality (1.26) holds. Thus 
for this n 

, 


I f(z") — fn (2) | | fn C2") — F(z") | <jr=a 
and therefore, by (4.28), 


LCA L VIS fn 2") — fn 2") | + pte. 

From the Jast inequality and from the arbitrariness of points a’ 
and 2” it immediately follows that inequality (1.27) holds for the 
choses) integer 7. 

Denote now, for the arbitrary subdivision of fa, b] we have assumed, 
the upper and lower sums of f (rv) by S and s and the upper and Jower 
sums of f, (x) by S,, and s,. 

Multiplying inequality (1.27) by the length of the Ath closed in- 
terval Az, and then summing it over all & = 1, 2, ..., m we eet 


S—sisS, —s, 4-€. (1.29) 


Inequality (1.29) is established by us for an arbitrary subdivision 
of fa, J. By the integrability of the function 7, (7) on fa, J] we can 
find a subdivision of fa, b] for which S, — s, <Ce* and therefore, 
by (4.29), S —s< 2e. 

Since ¢ is an arbitrary positive number, the last inequality proves 
that f (7) is integrable on [a, b}**. Thus the theorem is proved. 

Now we formulate Theorem 1.8 in terms of functional series: 

If o functional series (A.15) converges to its sum § (7) uniformly on 
a closed interval {a, bl and if cach term of that series u,, (x) isa function 
integrable on fa, b), then S (7) is also integrable on la, b), it being 


* Rv Theorem (004 of [A]. 

** By Theorem tC. tof (tf, Che existence for an achitrary © >. oof a sub 
Wisasion of a closed iuterval for which S ~— sc 2r is a necessary aid sulfeient 
condition for any function founded on the interval to be integrable. The bounded- 
ness of f fry on fa, &} iminediately follows from (1.26) and from the boundedness 
of (be function f, (2) intevrable on fa, 4). 
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possible to integrate (1.15) over [a, b] term by term, i.e. the series 


co ob 


> \ up (x) dx 
k=l a 
b 
converges and has as its sum \ S (x) dz. 
a 

Remark. In courses of mathematical analysis Theorem 1.8 tends 
to be proved under the more stringent assumption that every function 
f, (x) is not only integrable but is continuous on [a, bJ as well. With 
this additional assumption the above proof becomes simpler, for it 
is sufficient to refer to Theorem 1.7 to prove that the limit function 
f (x) is integrable on [a, 0}. 

{.2.2. Term-by-term differenlialion. Now we prove the following 
main theorem. 

Theorem 1.9. Let every function f, (2) have a derivative f, (x) on a 
closed interval la, b]*, the sequence of derivatives {f, (x)} converging 
uniformly on [a, b] and the sequence {f, (x)} itself converging at least 
at one point xo of [a, b]. Then {f, (x)} uniformly converges to some 
limit function f (x) on the entire interval [a, b], {f, (x)} being differenti- 
able on {a, b] element by element, i.e. everywhere on [a, 6) f (x) has 
the derivative f' (x), which is the limit function of {f,) (x)}. 

Proof. We first prove that {f, (z)} uniformly converges on [a, 5}. 
From the convergence of the number sequence {f, (z»)} and from 
the uniform convergence on [a, b] of {7;, (x)} we deduce that given an 
arbitrary © >> 0 we can find JN (e) such that 


| fnvp (to) — fn (to) <r fare (fale) |< yer (1-30) 


for all n > N (e), all natural p and (this applies to the second of the 
inequalities (4.30)) all a of [a, 0]. 

Let x be an arbitrary point in [a, bJ. Given any fixed n and p, 
all the hypotheses of the Lagrange theorem hold on [z 9, z] for the 
function [f,4p (t) — fr (8)] (see Theorem 8.12 in [1]). By that theorem 
there is a point & between zx and zy such that 


[fn+p (2) —fr (x)] — [fntp (Xo) —fn (Zp)] = 

= (fn+p (§) — fn (§)] (2 — 2%). 
From this last equation and from (4.30), on taking into account the 
fact that |x —2),|<b—a, we get 

lfntp (x) ~~ fa (zx) | <eé 
(for any xin [a, b], any n > N (e) and any natural p). 

* Here and in what follows by “f (z) has a derivative on a closed interval 


{a, b]” we mean the existence of a derivative f’ (x) at any interior point of [a, 5], 
the right-hand derivative /’ {a + 0) at a and the left-hand derivative f’ (6 — 0) 
at b. 
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Put thie just means thet {f, (z)} converges uniformly on [a, v 
fo some Jimit function f (x)*. 

it remains to prove that f(z) fas @ derivative at any point z, in 
fe. &) and that that derivative ic the limit function of {f, (z)}. 

Choose an arlitrary point z, and, correspondingly, a positive 
number 6 such that the é-netghbourhood of z, is contained entircly 
in fa, b) (in case z, is an end point of fa, db], by the 6-neighbourhood 
of x, we shal) mean either the right-hand hal{-neighbourhood 
(a, a -+- 5) of the point @ or respectively the left-hand half-neigk- 
hourhood (b — 6, &) of b). 

Denote by {Az} a set of all numbers Ax gatisfying the condition 
O<tAr {<8 whena< 7, <b, the condition 0 << Az < 6 when 
ZY, z= aor the condition —6 < Az < 0 when z, = 6 and prove that 
the sequence of functions of the argument Az 


Az)— fr. (79) 
Az 


converges uniformly on the set {Az}. 

Given an arbitrary c¢ > 0, by the uniform convergence of {f} (z)} 
there is NV (e) such that 

binep(2)—falz)}<e (4.34) 
for all z in fa, b), all x S N (e) and al] natural p. 

Observing this, choose an arbitrary Az of the set {Az} and apply 
the Lagrange theorem to [f,+, (t) — f, (t)] (with any n and p chosen) 
on {7,, %> (- Az]. By the theorem, there isa number 0 inO <0< f 
such that 


Gr (Az) ~— fa lzg- 


"j nap (\7)— Ts, (\7) = 


__ nap o> Az)— fn (r94- Az)J—[fs an (=e) — In (=,)} 
= AS J 


= fen (7p-+-O Az) — fa (79+ 89 Az). 

From the last equation and from inequality (1.31) valid for all 
the points 7 of [a, b] we get 

lp,4y (Az) — G, (Az) | <e 
for any Ax of {Azv}, any n D> N (ce) and any natural p. Thus 
{c, (Ax)} converges uniformly on {Az} (by the Cauchy criterion). 
But this allows Theorem 1.6 on proceeding to the limit term by term 
(o he applied to {q, (Az)} at Az = O. According to Theorem 1.6°%, 

i (Xo -- Ax) — | (xe) 

srt 


a enenenmnied 


* By the Cauchy criterion. i.e. by Theorem 1.4. 
** The formulation of Thearem 4.Gin terms of functional sequences is used, 
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the limit function {@, (Az)}, has a limiting value as Ar > 0, with 


f (2p Az)—f (Zp) = lim (lim Pn (Az)] = — 
Av~-0 z Axv-0 n-c 


= lim [ lim Gn (Az)) = lim} Jim 


~~ oc noc -§Ax-0 


fn (2g tt AZ)—Jn (Zo) | _. 
Az _ 


= lim tn (Zp) 


Tic 


This proves that the derivative of f (x) at Z> exists and is equal to: 
jim f, (zo) The theorem is proved. 
Ti+ CO 
Now we formulate Theorem 1.9 in terms of functional series. 
Ij every function wu), (“) has a derivative on a closed interval 


eo e os es =) e 
(a, 5} and if the series of derivatives N us, (x) converges uniformly 
h=4 


ao 
on fa, b] and the series u; (x) itself converges at least at one 
k= 


ce 


point of the interval [a, b}, then ue, (%) converges uniformly on 
k= 


the entire interval [a, b] to a@ certain sum S (xz), Unis | series being 
dijierentiable on fa, b} term by term, i.e. its sum S (zx) has on 


= 


(a, b] aderivative which is the sum of the series of derivatives > & Us (x). 


Remark 1. We stress that Theorem 41.9 assumes only that” every 
function f, (x) has a derivative on [a, db]. Neither the}boundedness nor, 
moreover, the integrability or continuity of that derivative is 
required. [t is usual in courses of mathematical analysis to prove 
Theorem 1.9 under the additional hypothesis that every derivative 
fn (xz) on la, b} is continuous. 

Remark 2. If Theorem 1.9 requires in addition that every derivative 
dn (2) should be continuous on [a, b], then by Theorem 1.7 the deriv- 
ative of the limit function f (z) is ‘also continuous on fa, b}. 

Remark 3. For the case of functions of m variables Theorem 
1.9 assumes the following form: if every function f, (z)=f, > 
X (4, .... Lm) has on a bounded set iz} of points in £™ a partial 
Gin 
Or, 
on {z} and the sequence {f,(z)} itself converges at each point of 
{rz}, then {f,(z)} can be differentiated with respect to a variable 
x, on {x} element by element. 

Theorem 1.9 yields the following statement. 

Theorem 1.10. If every function f, (x) has an antiderivative on [a, b} 
and if the sequence {f, (x)} converges uniformly on [a, b] to a limit 
function f (x), then f (x) also has an antiderivative on [a, bJ. \foreover, 


ren: . 
derivative Soe and if the sequence 


converges uniformly 
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ifr, is any paint of la. b), then the sequence of antiderivatives D, (2) 
of ihe functions f,, (x) satisfuinge the condition , (x) = 0 converges 
uniformly on la, b) to the antiderivative MD (x) of j (r) satisfying the 
condition ( (rq) = 0. 

Proof. Tt suffices to notice that for the sequence of antiderivatives 
(Dp, (x) satisfying the condition , (75) = 0 all the hypatheses of 
Thearem 1.9 hold. This ensures that the sequence {, (2)} converges 
uniformly on {a, 0] to the limit function @ (7) which at each point of 
{a, db] has a derivative equal to the limit function f (r) of {f, (x)}. 

Remark 4. We stress that Theorem 4.10 requires neithert he bound- 
edness nor, morcover, the integrability of functions f, (rv) on [a, d}. 

The last three subsections allow us to draw the following important 
conclusion: uniform convergence does not change the class of functions 
having a limiting value (Theorem 1.6), the class of continuous functians 
(Theorem 1.7), the class of integrable functions (Theorem 1.8), the class 
of functions having an antiderivative (Theorem 1.10) or the elass 
(in the case of wniform convergence of derivatives) of differentiable 
functions (Theorem 14.9), 


In conclusion consider an example, based on Theorem 1.9, of a function f (r) 
whee derivative ys’ (z) exists everywhere ona closed interval [O. 1] but is dis- 
continuous at each rational point of the interval, 

Let 


j 
cos — When x: 0, 
0 When z=0, 


so that the function 


. | { 
sin me 2r-cos— when sz = A, 
2° x 


q’ (r)~ 


ner 


Q) when jz -=( 


is discontinuous at z = O andl continuous at all the other points. We number all 
the rational points of [G, 1} to form the sequence x1, ray. es Tye eee (the 


possibility of Uhis was proved in Section 3.4.3 of {1}) and put a, (x) = lt x 


. wo ge i - te ; 
> {xr ~ z,). Then every derivative uj; (z) = ar (ry ~— z,) is discontinuous 


at ane paint z) and continuous at all the ather points. Since for all x of {0, 1) 


{xr—~—sr, }F -- 2 —_ “8 
moiety enetstcate § 


~ ke h 


en 

“Ss 

ow cr? 2) 

e % e C7 

both serics Sou, (x) and \’ uf (x), are majorized by the convergent number 
—f hes 


a 


o { oe e 
series 3 » “= and therefore converge uniformly on [V, 1}. By Theorem 1,9 


hood 
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oO 


the sum f (zx) of S' u, (rz) has on (0, 1] a derivative f’ (x) equal to the sum of 


oo 
> u, (x) and discontinuous at every point z, (k = 1, 2,...). 
k=1 


1.2.3. Convergence in the mean. Suppose every function f, (x) 
(n = 1, 2, ...) and the function f (z) are integrable on a closed 


interval [a, 6]. Then (as is known from Chapter 410 of [1]) so is the 
function 


(fn (2) — f (2)? = fa (2) + P (@) — 2 fn (2)-f (2). 


We introduce the fundamental concept of convergence in the mean. 
Definition 1. A sequence {f, (z)} is said to converge in the mean toa 
function f (x) on a closed interval [a, bj, if 


b 
lim | Un (2)—# (2)P dz =0. 


Definition 2. A functional series is said to converge in the mean to 
a function S (x) on a closed interval (a, b] if the sequence of partial sums 
of that series converges in the mean to S (x) on fa, b). 

Remark. [t follows from the definitions that if a sequence (or a 
series) converges in the mean to f (z) on the entire interval [a, b], 
that sequence (or that series) converges in the mean to f (x) also on 
any closed interval [c, d] contained in [a, bd]. 

Now we wish to show the relation between convergence in the 
mean and uniform convergence of a sequence. 

We first prove that if a sequence {f, (x)} converges uniformly to 
f (x) on {a, 6], then {f, (x)} converges aiso in the mean to f (x) on {a, 6). 


Take an arbitrary ¢ >> 0. Givena positive number V ois , by 
uniform convergence there is N such that 


Lin @—KOl<V agp (1.32) 


for all x in [a, b] and alln SUN. 
By (1.32), for all n>WN 


b b 
| th @—f Parca. | dz=z<e, 


a 


i.e. {f, (x)} converges in the mean to f (z) on la, 0). 

Now we shall show that convergence of a sequence on some closed 
interval in the mean does not imply not only uniform convergence 
that interval but also come ;ce at least at one point of the ° 
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Consider a sequence of closed intervals J,, J,,... belonging to 
{0, 1] and having the following form: 


[= (0, 4}, 


fm [O 4]. tone 4] 


ee ¢- ee * @®© #* oe ee «© ®* e® @® ee 6 


We define the nth element of the sequence as follows 
oo 4 on /,, 
In (2) = { O at the other points of [0, 4]. 


The sequence we have constructed converges in the mean to f (x) = 0 
on [0, 4]. 


Indeed, 

i 

(fn (2)—OPdz= | fi (2) de= | dr= 
0 I, i, 


=length of closed interval J, -» 0 (as n —- oo). 


At the same time the sequence converges at no point of {0, 4). 

Indeed, whatever point 2, of [0, 1] we may take, among arbitrarily 
large n we can find both such for which J, contajns a point Z, (for these 
integers f, (t)) = 41) and such for which J, does not contain z, (for 
these integers jf, (z,) = 0). Thus {f, (75)} contains infinitely many 
elements, both equal to unity and zero, i.c. this sequence diverges. 

It turns out that convergence of {7, (z)} to a limit function f (z) 
on [a, b}] in the mean ensures that {/, (z)} can be integrated over 
fa, b] clement by element. 

Theorem 1.41. If a sequence {f, (x)} converges in the mean to a 
function f (z) on a closed interval {a, b), then that sequence can be in- 
tegrated over la, b] element by element, i.e. 

b 
lim | fe (x) dz 


Nw™w © 
Cc 


! 


f (x) de. 


ezists and is cqual to 


aa ie, 
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We first prove the following Jemma. 


Lemma 1. For any functions f (x) and g (x) integrable on’ a closed 
interval {a, b} we have the inequality 


Hf reetae|<V/ [ meear| @teae (1.33) 


called the Cauchy-Buniakowski inequality. 
Proof of Lemma 1. Consider the following quadratic trinomial in }: 
b 


| Uf (2) Ag (2) Pde 


a 5 > b 
= | f2(a) de—20 | f(a) g(a) da-+A? | g(2) deS0. 


Since this trinomial is nonnegative, it has no distinct real roots. But 
then its discriminant is nonpositive, i.e. 


b > , 
( \ i (x) g (z) dx)" — { ig (x) dr \ g2 (x) dz<0. 


Thus the lemma is proved. 
Proof of Theorem 1.11. Using inequality (1.33) for g (x) =1 
we have 


| fn (x) dx— f(x) dz =| [fn (2) ~f (2)) dex < 


<V J thie) —f (tae | ae = 


a a 


ye 
-V 6-0 | [fn (2) —f (@)Pdz > 0 
(as nm —» co), The theorem is proved. 


1.3. EQUICONTINUITY OF A SEQUENCE OF FUNCTIONS. 
THE ARZELA THEOREM 


Suppose each of the functions f, (x) is defined on some closed 
interval {a, 8]. 


Definition. A sequence of functions {f, (x)} is said to be equicon- 
tinuous on [a, b] if given any & > 0 we can find 6 > 0 such' that 

l fn (2) —fa (2) | <e 
3% 


ao Fandamentals of Mathemecatical Analusts 


holds for all n and for all points x' and x” of {a, b) related by 
fr’ — 2" {[<s. 


Remark 1. It is immediate from the definition that if {f, (z)} is 
equicontinuous on [a, db}, then so is any of its subsequences. 

We prove the following remarkable statement. 

Theorem 1.12 (Arzela). [fa sequence of functions {f, (x)} is equicon- 
tinuous and uniformly bounded on a closed interval [a, b], then we 
can choose a subsequence uniformly converging on [a, Db). 

Proof. Consider on [a, 5} the following sequence of points {x,)}: 
take as z, the point bisecting [a, bJ, as z. and x, the two points quar- 


Fig. 4.3 


tering, togother with z,, the interval [a, b] (Fie. 4.3), as 2,, 25, 2», and 
z; the four points dividing, together with z,, z,, and zy, tho interval 
fa, b] into eight equal parts (Fig. 4.3) and so on. 

Tho sequence {z,} we have constructed has the following property: 
whatever 6 > Owe take, we can find for it 2) such that on any interval 
of [a, b) of longth § there is at Jeast one of the clemonts 2,, 2,,... 
weg Ey” 

Now wo proceed to choose a subsequence of {f, (z)} uniformly 
convorging on [a, 0]. First consider {f, (z)} at z,. We obtain a bounded 
number sequence {f, (z,)}; by the Bolzano-Woierstrass theorem (seo 
Section 3.4 of [1]) we can choose a convergent (or regular) subsequenco 
of {f, (z,)} which we designate 


fay (21) fae (Za). - or fan (%)y eo 
Thon consider the sequence 
far (2): fae (Bye or Tin 


at z.. By the Bolzano-Weierstrass theorem we can choose a con- 
vergent subsequence which we designate 


fay (He), fee (Lede s - +> Fen (Ze), -- 
Thus the sequence 
fos (2) fan (Z)1 e+ 0s fan (2)... (1.34) 


is convergent at both z, and z,. 
Next consider (1.34) at zs and chooso a convorgent subsoquence 


Is: (Zs)1 Te (Zs) - + +1 Fan (23), oo 


* A sequence baving this property is said to be everywhere dense on (a, b}. 
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Continuing our arguments in a similar manner we obtain an infinite 
number of subsequences 


ie (z), hte (x), fig (2), oeey tin (2), eee 
fos (&)y foo (©), fog (@)y «+ +5 fon (2), +» 
fs1(2), fon (2), fsa (2), -- +s fan (X),--- 


e e® 8 8@® @ @ @®© &® @ #$@ @  @ @ j@ @ $.& $.@ @  @®  @® 


._ ee 8 &@ @ 8 @© @® @ @ @ 6 e@ @  j@  $@  @ $@  j.@  @ @ 


' the subsequence in the nth row being convergent at each of the points 
XZ, Lo, -. 
Now consider what is called a “diagonal” sequence 


hy (x), foe (z), o 8 09 Tan (x), cee 


We shall prove that this sequence uniformly converges on [{a, 0). 
To abbreviate the notation in what follows we shal] designate this 
(as well as the original) sequence as 


hy (x), te (x), ee ef Th (x), a ee 


(i.e. we shall use a single index instead of the double one). Take an 
arbitrary s > 0. 

Since the diagonal sequence is equicontinuous on fa, bj, given 
€ > 0, there is 5 > 0 such that whatever two points x and Zp, of 
fa, b] related by |z — 2m | << 6 we may take, for all n 


| fr (t)—fh (Zm) | < = (1.35) 


Noticing this fact we divide [a, b] into a finite number of closed in- 
tervals of length smaller than 6. We choose a finite number nz, of the 
first elements 21, %,,..., Xn, Of {z,} large enough for eack of the closed 
intervals to contain at least one of the points x1, Xo, . . ., Xn, 

Obviously the diagonal sequence converges at each of the points 
214, Xq, -» +, Ln, Given the above & > 0 therefore, we can find N 
such that 


| frp (2m)—In (2m) |< (1.36) 


for all n > N, all natural p and all m = 1, 2, 

Now let x be an arbitrary point on [a, db}, it would lie in one of the 
above closed intervals of length smaller than 6. For this x therefore 
there is at least one point z,, (m is one of the integers equal to 
1, 2, ..., Mp) Satisfying the condition | z — z,, |< 6 


re) 
1?) 
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Since the modulus of the sum of threo terms is not greater than the 
sum of theirtmoduli, we can write 


| frep (x)—f,, (xr) j<| fetp (z)— ftp (Z;,) | > 
++] faep (I) — fF (Zr) | + | I, (tm)—Fn, (z) [. (1.37) 


We evaluate the second term on the right-hand side of (4.37) 
by means of inequality (1.36) and to evaluate the first and the third 
term we use the fact that |z—z, {<< 6 and inequality (4.35) 
holding for any n (and therefore for any n + p). 

We finally have that given an arbitrary e > 0 there is V such that 


ftp (2) — fp (2) |<e 
for all 2 S> N, all natural p and any point z in [a, b}]. Uniform con- 
vergence of the diagonal sequence is proved. Thus Theorem 1.12 is 
proved. 

Remark 2. In the Arzela theorem, instead of uniform convergence of 
{f, (z)} on fa, b) it is sufficient to require boundedness at least ai one 
point of the interval. Indeed, the following stalement is true: if a 
sequence {f, (z)} is equicontinuous on a closed interval [a, b] and bounded 
at least at one point Xp of the interval, then it is uniformly bounded on 
[a, bv]. To prove this we notice that by the definition of equicontin- 
uity, given © = 4 we can find 6 > O such that the oscillation of any 
function f, (z) on any closed interval not greater than 6 does not 
exceed ¢ = 1. Since the entire interval [a, b] can be covered with a 
finite number m, of closed intervals of Jength not greater than 4, 
the oscillation of any function 7, (x) on the entire interval [a, b] does 
not exceed the number n>. But then the inequality | fx (%)) |< A 
expressing the boundedness of {f, (z)} at xz implies the inequality 
lf, (zc) | <A + xn, true for any point z of [a, b] and expressing the 
uniform boundedness of the sequence under consideration on [a, 8]. 

Remark 3. We establish a sufficient test for equicontinuity: if 
a sequence {f,, (x)} consists of functions differentiable on a closed interval 
[a, b] and if the sequence of derivatives {f;, (x)} is uniformly bounded on 
that interval, then {f, (x)} is eguicontinuous on [a, 0). 

To prove this take on [a, b] two arbitrary points x’ and zx” and 
write for f, (z) on [z’, z”] the Lagrange formula (see Section 8.9 


of [1]). 
By the Lagrange theorem, there is a point —, on [z’, z”J such that 
L fn lz" )—fn (2") |=1 fa (En) | | 2° — 27 |. (1.38) 


Since {/; (z)} is uniformly bounded on fa, b], there is a constant A 
such that for all n 


lin (Sn) | SA. (1.39) 
Substituting (41.39) into (1.35) we get 
L fn (29 —fn (2°) SA fz — 2" |. (1.40) 
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Choose any ¢ > 0. Then, taking 6 = e/A and using (1.40), we get, 
for all nm and for all x’ and x” of {a, b] related by | 2’ — 2” {[<4, 
the inequality 


| fn (2) — fa (2") | <e. 
The equicontinuity of {f, (x)} is thus proved. 


sin nz 


As an example consider a sequence { i. This sequence is equicon- 


tinuous on any closed interval [a, d], for the sequence of derivatives {cos nz} 
is uniformly bounded on any closed interval [a, }). 


Remark*4, The concept of equicontinuity can be formulated not 
only with respect to the closed interval [a, 6}, but also with respect 
to the open interval, half-open interval, half-line, an infinite straight 
line and{withjrespect to any set dense in itself in general*. Moreover, 
this notion may be applied not to the sequence of functions but to 
any infinite number of functions, 


1.4. POWER SERIES 


1.4.1. A power series and itsdomain of convergence. A power series 
is a functional series of the form 


Ao +S Ay," = Ay a,x dor? +... ape "+... (1.41) 


Where Gp, @1, Go, ++) Qn, + + are Constant real numbers called the 
coefficients of the series (1.41). Let us try and see what the domain of 
convergence of any power series looks like. 

Notice that any power series converges at a point x=Q, there 


existing pover series converging only at that point (for example, 
the series >) k!-z* does). 


k= 
Form the following number sequence using the coefficients a, of 
(1.44): 


(/Tanl}) (n=1,2,...). (1.42) 


There may occur two cases: (1) the sequence (1.42) is unbounded; 
(2) (1.42) is bounded. 

In case (2) the sequence (1.42) has a finite upper limit (see Section 
3.4.3 in [4]) which we denote by LD. It should be stressed that the 
upper limit Z is automatically nonnegative (for all the elements of 
(4.42) and therefore any limit point of this sequence are nonnegative). 


* The Arzela theorem remains valid if we replace the closed interval [a, b] 
by any closed bounded set in the theorem, 
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Summarizing we come to the conclusion that the following three 
cases are possible: (I) the sequence (4.42) is unbounded; (IJ) (4.42) 
is bounded and has a finite upper limit L > 0; (ITY) (4. 42) is bounded 
and has an upper limit ZL = Q. 

Now we prove the following remarkable statement. 

Theorem 1.1.3 (Cauchy-Hadamara). 

I. I} the sequence (4.42) is not bounded, then the power series (4.41) 
converges only when x = 0, 

Il. If the sequence (4.42) is bounded and has an upper limit L > 0, 
then the series (4.41) absolutely converges for values of x satisfying the 
inequality {x |< 4/L and diverges for values of z satisfying |x |> 
> U/L. 

IYI. ff the sequence (1.42) is bounded and its upper limit L = 0, 
then the series (1.41) absolutely converges for all values of x. 

Proof. 

J. Let (1.42) be unbounded. Then for z = 0 so is the sequence 


{x f- Ww {ay | = { @y°2" ? 
i.c. this sequence has elements with arbitrarily large n satisfying 
yy Q,°x"\o>>1 or fa,-z"| > 1. 


But this means that the necessary condition of convergence fails for 
the series (1.44) (with z 54 0) (see Section 13.4.2 in [4]), i.e. (4.44) 
diverges when z = 0. 

IT. Let the sequence (1.42) be bounded and let its upper limit 
L > 0. We prove that (1.41) absolutely converges when | z | < 4/1 
and diverges when |x | > 4/D. 

(a) First choose any = satisfying {az |< 4/D. Then there is ¢ > 0 
such that Jz]< < AL +e). By the properties of the upper limit 
all elements 7/ a, | beginning with some n satisfy 


ae { ap, |< b+. 


Thus, beginning with this n, 


L--— 
Y Yana | =| 14/Tanl Mpa <1, 


i.e. the series (1.441) absolutely converges by the Cauchy test (see 
Section 13.2.3 in [4]). 

(b) Now choose any z satisfying | x [> 1/Z. 

Then there is e >0O such that Jz}> 1/(1—e). By the dehnition 
of the upper limit we may choose a subsequence {"}/ Tan, 1} (p= 


=1, 2, ...) of the sequence (1.42) converging to ZL. 
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But this means that, beginning with some k, 
L—e<'¥ [an, | <L+e. 


Thus, beginning with that k, 
ny ees n _ 
VY lant | =|21+ Tan, | >foe=1 


or 
| a@n,-2"* | >4, 


i.e. the necessary condition for the convergence of the series (1.41) 
fails and the series diverges. 

IIt. Let the sequence (1.42) be bounded and let its upper limit 
L = 0. We prove that (1.41) absolutely converges for any 2. 

We choose an arbitrary z = 0 (when z = 0 the series (1.41) auto- 
matically converges absolutely). Since the upper limit Z = 0 and 
the sequence (1.42) cannot have negative limit points, the number 
EL = 0 is the only limit point and therefore it is the limit of that 
sequence, i.e. (4.42) is infinitesimal. 

But then for any positive number 41/2 | z | there is an integer begin- 
ning with which 


TT 4 
7 Jay | <ZE7° 
Therefore, beginning with that integer, 
— —— 4 
v | Anz” |=|2|-7/] ay, I<z<i, 


i.e. (1.41) absolutely converges by the Cauchy test (see Section 
43.2.3 in [1]). This completes the proof of the theorem. 

The theorem leads to the following fundamental statement. 

Theorem 1.14. For every power series (1.41), unless it is one converging 
only at a point x = 0, there is a positive number R (possibly equal to 
infinity) such that that series absolutely converges when |x |< R and 
diverges when |x |> RR. 

The number R is called the radius of convergence of the series and 
the interval (—R, R) is the interval of convergence of the series. The 
radius of convergence is computed from the formula 


——____1 (1.43) 


(when lim V/ ]a,|=0, R=oo). 


Tl CO 
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Remark {, At the end points of the interval of convergence, i.e. at 
the points r = —R and z = R. a power series may be both con- 
verzent and divereent™. 


~«< 
Thus the series 1+ S z* has a radius of convergence equal to 
ti 
unity and the interval of convergence of the form (—1, 1) and 
converges at the end points of the interval. 
. = xh . 
The series > =x has the same interval of convergence, (—1, 4), 
K-- 
but converges at both end points of the interval. 

Remark 2, All the results of this subsection hold for a series (1.41) 
in which the real variable z is replaced by a complex variable <«. 

For such a series we can establish the existence of a positive 
number /? such that the series absolutely converges when |s |< R 
and diverges when | 2 {> /?. 

To compute J? we use formula (1.43). The number 72 is called the 
radius of convergence of the series and the region {2 {< # is the 
circle of convergence of the series. 

1.4.2. Continuity of the sum of a power series. Let a power series (1.41) 
have the radius of convergence R > 0. 

Lemma 2, Given any positive number r satisfying the conditionr <Q, 
the series (1.41) uniformly converges on a closed interval [—r, r}, i.e. 
when [ajl<r. 

Proof. By Theorem 1.14 the series (1.41) absolutely converges 
when wv =r, i.e. 


03 


lag|-+ » fa, er’ 


converges. But this number series serves as a majorant of (1.41) for 
all z of [—r, r]. By the Weierstrass test, (1.44) converges uniformly 
on [—r, r]. Thus the lemma is proved. 

Corollary. Under the hypotheses of Lemma 2 the sum of a series (1.41) 
isa function continuous on a closed interval [—r, r] (by Theorem 1.7). 

Theorem 1.135. The sum of a power series is a continuous function in 
the interval of convergence of the series. 

Proof. Let S (x) be the sum of a power series (1.41) and Iet 
be its radius of convergence. Wo take any z in the interval of con- 
vergence, i.e. such that |z |< #. Thoro is always a number r such 
that |z |<r<( RR. By the corollary of Lemma 2, S (zx) is con- 


* Note the following theorem of Abel: if the power series (11) converges when 
=z = f, then its sum S (z) ts continuous at the point R from the left. We may assume 
without loss of generality that 2 = 1, but in this form Abel’s theorem (asserting 
in fact the jresularity of the Abel-Poisson summation method) is proved in 
Supplement 3 to Chapter 13 in [fl]. 
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tinuous on [—r,r]. Therefore S (x) is continuous at z too. Tho theorem 
is proved. 

1.4.3. Term-by-term integration and term-by-term differentiation 
of a power Scrics. 

Theorem 1.16. If R >0 is the radius of convergence of a power series 
(1.41) and x satisfies the condition |z |< R, then (1.41) can be in- 
tegrated term by term over the closed interval [0, x]. The resulting series 
has the same radius of convergence R that the original series has. 

Proof. For any = satisfying the condition | z ]< R we can find r 
such that Jz [<r< FR. By Lemma 2, (1.41) converges uniformly 
on [—r, r] and hence on [0, z]. But then, by Theorem 1.8, that sories 
can be integrated over [0, z] term hy torm. 

Integrating term by term yiolds a power series 


whose radius of convergence, by Theorem 1.14, is the inverse of the 
upper limit of the sequence 


nfl 7 OSC TT 
j/ Laace a Vener (1.44) 
rt vn i 


Since the uppor limit of (1.44) is the same that (1.42) has*, the 
theorom is provod. 

Theorem 1,17. A power series (1.41) can be differentiated term by 
term in ils interval of convergence. The resulting series has the same 
radius of convergence R that the original series has. 

Proof. It is sufficient (by Theorem 1.9 and Lemma 2) to prove 
only the second statemont of tha theorem. 

Differentiating (1.44) term by term yiolds a serics 


O,+-2-a,-2- 2... $n-dg 2 I+ (n $1) edgy a 7+... 


whoso radius of convergence (by Theorem 1.44) is the inverso of the 
upper limit of the sequence 


(7 (2-4) | Gnas |}. (1.45) 


. on 11 pe mee Tt f 
* For limy/n=4, lim Jan] = lim” ~YTant= 
Ti Nero Tl 00 
n 
= lim [fan] ] "bt? =lim [7/{an I J 
1-00 % co 
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Since (4.45) has the same upper limit that (1.42) has*, the theorem 
is proved. 

Corollary. A potcer series can be differentiated term by term as many 
limes as we like in its interval of convergence. 

The series resulting after n-fold term-by-term differentiation of the 
original series has the same radius of convergence that the original series 
has. 


1.5. POWER SERIES EXPANSION OF FUNCTIONS 


1.5.1. Power series expansion of a function. 

Definition 1. We shall say that a function f (x) can be expanded into 
a power series on an open interval (—R, R) (on a set {x}) if there is 
a power series converging to f{ (xz) on (—R, R) (on {z}). 

The following statements are true. 

1°. For a function f (xz) to allow expansion into a power series on an 
open interval (—R, FR), tt is necessary that it should have continuous 
derivatives of any order on (—R, R)**. 

Indeed, a power series can be differentiated term by term as many 
times as we please in its interval of convergence which in any case 
contains the open interval (—2, A), all the resulting series con- 
verging in the same interval of convergence (Theorem 1.17). 

But then the sums of the series obtained by differentiating as 
any times as we please (by Theorem 1.15) are functions continuous 
in the interval of convergence and are therefore continuous on 
(—R, R). 

2°, Jf a function f (x) can be expanded into a power series on an open 
interval (—R, R), this can only be done in a unique way. 

Indeed, let f (7) be a function that can be expanded on (—f, FR) 
into a power series (4.44). 

Differentiating (4.41) term by term mn times (which can trivially 
be done in the interval (—R, R)) we get 


f (2) = agen! 4- yay? (n+ IW) az! 4... 


5 . nw —s Nn 
* For lim y’n-- 1:4, Jim VY lena | = 
Nhe co yi 00 
i a 
=fJim 7 |enl= lim [77 fan| jy™-! — lim 4/7 {az |}. 
Tio 0 Tle oe Theo ox 


** Note that there are functions that have continuous derivatives of an 
order on (--R, ft) but cannot be expanded into a power ‘serics on that interval. 
As an illustration 


-Msx* when - =: 0 
4 x =: 0, 
(z= { 


0 when ---0, 
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From which for 7 = O we find 
f™ (0) = an -n! 
or 


__ 17 (0) 
no ml ° 


(4.46) 


Thus the coefficients of (1.41) into which we can expand f (zx) are 
uniquely defined by formula (1.46). 

Suppose now that f (x) has continuous derivatives of any order on 

—R, RB). 
Definition 2. A power series (1.41) whose coefficients are defined by 
formula (1.46) are called a Taylor series of the function f (2). 

Statement 2° leads us to the following statement. 

3°. If a function f (x) can be expanded into a power series on an open 
interval (—R, R), that series is a Taylor series of the function f (x). 

In conclusion we formulate the following statement immediate 
from Section 8.414 of [4]. 

4°, For a function f (x) to allow expansion into a Taylor series on an 
open interval (—R, R) (on a set {x\) it is necessary and sufficient that 
the remainder in the Maclaurin formula for that function should tend 
to zero on (—RR, R) (on {z}). 

1.5.2. Taylor expansion of some elementary functions. It was 
proved in [1] (see Section 8.15.2) that the remainders in the Maclaurin 
formula for the functions e*, cos z, and sin x tend to zero on the 
entire infinite straight line and that the remainder in the Maclaurin 
formula for the function In (1 + x) tends to zero on the half-open 
interval —1 <2 < -+1. 

By Statement 4° of the preceding subsection this brings us to the 
following expansions: 


ete 3S. 


n=1 


4)n man 


cost=41-+ >) a 
n=1 


. a (—1)” genti 
sint= 2) Gntin? 
n==D 
<1 (Anti an 
in(Qi+2z)= >) a 
n=1 
he first three of these converge for all values of x, whi] 
one ‘oes for the values of ein -—-1t<z¢< 1, ile the last 
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Now we discuss power series expansion of the function (1 -+- z)* 
or what is called a binomial serics. 

If f(z) = (1 =- 2)% then 

PO (7) = ala—i)(a— 2)... (@—n + 1)(1 + rer. 


The Maclaurin formula with remainder in the Cauchy form is Uhere- 
fore of the form (see Section 8.44 of [4]) 


TN 
(42)%=1+3 PME MED he Rar (2) (1.47) 
k=! 
where 


Rass (2)y= SS amt. 7749 (02) = 


n! 


= LOOP oto (a—1)...(a—n) (144+02)°°" = 
~ (2 )": Ce 2 (GAM) og (f+ Ox) 21 at (1.48) 


(0 is some number in the interval 0 < 0 < 1). 
We first show that when @ > 0 the remainder R,+, (z) tends to 
zero (aS n —» oo) everywhere on —i <2<cl. 


Indeed, the elements of {(7=3)"} are al] not grenter than 


(a—14) ems) - (2—n) 


unity anywhere on —1<2<1; is bounde 


ed for any given a>0"; the number ats 4-Oz)*-! is defined 
for any given a@>>0 and any zx in the interval] —1<2<+-1; 
finally the sequence {z"*!} is infinitesimal for any z in —1< 
<2z<l. 

Thus by (1.48) the remainder 2,4, (z) tends to zero for any given 
a> QO and anv zin —1<7<c i. 

By (1.47) therefore, when a@ => 0, everywhere on —1<Or<ti1 we 
have 


my a(a—1)... (abd 
Ce er eo eco (1.49) 
re | 
Now we prove that when @ > 0 the series on the right of (4.49) untformly con- 


verges to (1 =- z)© on the elosed Interval ~1 Kz 
Evers where on this interval the series is majorized by the following number 
series: 


- faet-[i—ow]... [sia | 
ns nee (41.50) 
Kez! 


© AN the elements of the scquence are bounded in absclute value by the 
(~— 1) (a2 — 2)... (2 —Ja}) 


fai , Where [ce] is the integral part of a. 


nuinher 
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By the Weierstrass test, to establish uniform convergence on —1 <2 <i 
of the series on the right of (1.49) it is sufficient to prove the convergence of the 
majorant series (1.50). 

Denote the kth term of (1.50) by p,. Then for all large enough’k‘we get 


Pray kK yg _ fm 
=F 1 FLT: (1.54) 


This yields 


«7 (Pha 
lim 4 (4 ? 


00 he 


. k 
J=U+a) Kim yoy sto > 4 


i.e. (4.50) converges by the Raabe test (see Section 13.2.5 of [4)). 

We have thereby proved that when « > 0 the series on the right of (41.49) 
converges uniformly on —1 <2z<1. It remains to prove that that series 
converges on —1 <2 <1 to (f + x)%. 

By what was proved above the sum of the series on the right of (1.49), S (z), 
and the function (1 -+- z)™ coincide everywhere on ~1 <z<01. In addition 


both functions, S (zx) and a -- x)”, are continuous on —1 <x <1 (the function 
S (x) as the sum of a uniformly convergent series of continuous functions; the 


continuity of (1 + z)* is obvious when a > 0). 
But then the values of S (x) and (4 + 2z)* atz = —1 and z = 1 must coin- 


cide, i.e. the series on the right of (1.49) uniformly converges to (4 -+ z)~ on 
—1<71<1 


1.5.3. Some elementary facts about the functions of a complex 
variable. As already noted above, we may extend to the case of 
a power series in a complex variable z 


Ap + A424 Goz*+- 10. fans" -+... 


Theorems 1.13 and 1.14 (on the existence and value of the radius of 
convergence). Series of this type are used to define the functions of 
a complex variable z. 

The functions e*, cos z, and sin z of z are defined to be the sums 
of the following series: 


@=i+ >) =, (4.52) 
n=1 
cosz=1-+ >! aS. (1.53) 
n=t1 
. ~ ~—4{ ogeontl 
sin 2 = > ea (1.54) 
n=0 ' 


It is easy to verify that these three series absolutely converge for 
all values of z (their radius of convergence 2 = 00). 

Now we establish the relation between the functions ¢*, cos z, and 
sin Z. 
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Keplacing < by is in formula (1.52) we get 


ena | ' ° we ‘ (iz)? 8 {iz}? , (iz)! to (iz)° t 
a OT | 
=(l-faane)ti(s-Ftap—-). (1.99) 


Comparing the right-hand side of (41.55) with the expansions 
(1.93) and (1.54) we arrive at the following remarkable formula: 


e* = cos 2 ++ i-sins, (1.56) 


Formula (1.56) plays a fundamental part in the theory of functions 
of a complex variable and is called L£uler's formula. 

Setting in Euler's formula the variable = first equal to a real z 
and then to a real —r we obtain the following two formulas: 

e* = cosx -i-sinz, e?* = cosz —isina. 
Adding and subtracting these we obtain formulas expressing cos z 
and sin z in terms of an exponential function: 


efX iL ecdx 
cos r= —————-,, 
efx—eclx (1.57) 
sn t= —-—_.. 
2i 


In conclusion we shall discuss the definition of the logarithmic function 
uw = Inz of the complex variable <. It is natural to define this function as the 
inverse of the exponential, i.e. by the relation < = ec. Putting w = u + ty, 
z= =z-- (y We intend to express xv and vin terms of 2 = x -f- ly. 

From the relation 


sax te iy = eutiv = eu (cos uv 4- i sin v) 


we get, using the concepts of the modulus and argument of a complex number 
(sce formula (7.6) in {1]), 


Jjeft=< zip? er, arg s==v—2mk, 
where 
A= 0, 24, 3:2, 2... 
From these equations we find that 
nozIn|s[=In Js? y?, 
Uearged-20k |= (4: =0, &1, 2 2,...) 
or, finally, that 
Ine = In] s[ + 7? (args + 20k), where & =: 0, = 1, +2,.... (1.55) 


Formula (1.58) shows that the logarithmic function ts not single-valued in the 
complex domain: its Imaginary part has an infinite number of values corres- 
ponding to different k = 0, +1, +2, ... for the same value of <. 

It is easy to realize that a similar situation will arise when inverse trigono- 
metric functions are defined in the complex domain. 
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1.5.4. Uniform approximation of a continuous function by poly- 
nomials (the Weierstrass theorem). In this subsection we shall prove 
ine a undamental theorem due to Weierstrass who established it in 

Theorem 1.18 (Weierstrass). If a function f (x) is continuous on a 
closed interval [a, b], then there is a polynomial sequence {P, (x)} 
uniformly converging on {a, b] to f (x), i.e. given any & > 0, there is 
a polynomial P, (x) with an integer n dependent on & such that 


|P, (@) —f@)|<e 


at once for all x in [a, Db). 

Restated, a function f (x) continuous on a closed interval la, b) 
can be uniformly approximated on {a, b] by a polynomial with a preas- 
signed accuracy &. 

Proof. Without loss of generality we may consider a closed interval 
[0, 4] instead of [a, b]*. Moreover, it is sufficient to prove the theorem 
for a continuous function f (x) vanishing at the ends of [0, 14], i.e. 
satisfying the conditions f (0) = 0 and f (1) = 0. Indeed, if f (z) 
did not satisfy these conditions, then on putting 


g (x) = f @) —70)—<lf 4) —F (0)! 


we should obtain a function g (x) continuous on [0, 4] and satisfying 
the conditions g (0) = 0 and g (1) = 0, and it would follow from 
the possibility of representing g (x) as the limit of a uniformly con- 
vergent sequence of polynomials that f (x) can also be represented as 
the limit of a uniformly convergent sequence.of polynomials (for 
the difference f (x) — g (z) is a first-degree polynomial). 

So let f (x) be a function continuous on [0, 4] and satisfying the 
conditions f (0) = 0, f (1)=0. We may extend such a function to 
the entire infinite straight line by setting it equal to zero outside 
[0, 4] and state that a function extended in this way is uniformly 
continuous on the entire infinite straight line. 

Consider the following particular sequence of non-negative poly- 
nomials of degree 27: 


Qn (x) =c, (1—2")” (n=1, 2,...), (4.59) 
each having a constant c, chosen so that 
1 
| Qnz)do=t (n= 4, 2,044). _ (4.60) 
“4 
We majorize c, without computing its exact value. — 


* Since one of the intervals transforms to the other by linear substitution 
‘go (b—ai+a. “ ; , 


4—01684 


5D Fundementats of Meathematicel Analysts 


To do this we notice that for any n = 1, 2,... and all x in (0, 4] 
we have the inequality* 


(4 — x*)" > 1 — nx. (1.64) 
Using (4.64) and the fact that {/]/n <1 for any nD 1 we havo 
Uva 
\ (1—2°)"dr=2 | (4—2°*)"dzr>2 (1—2x*)"dzrS> 
={ 0 0 

Vn , 4 ; 
> 2 (1— nx?) (ESE Va (1.62) 


From (1.59), (1.60), and (41.62) wo derive that given any xn = 
=1, 2, ... the following upper estimate holds for ¢,: 


t, < Vn. (4.63) 

From (4.63) and (1.59) it follows that given any 6 > 0, for all z 
in ba xr<ai we have 

0<0n (2) Vn (1—82)" (4.64) 

From (1.64) we deduce tha given any fixed 5 > 0 the sequence of 
nonnegative polynomials {Q, (x)} converges to cero uniformly on 
§<2x<it**. 

Now for any zinO <x <1 we put 


Prlz)=\ f (x+1) Qn (t) dl (1.65) 


i 


VC, mem 


and show that given any n = 14, 2, ... the function P, (z) is a 
polynomial of degree 2n, with {P, (z)} the desired sequence of poly- 
nomials uniformly converging on O< 2 <1 to f (zx). 


* This inequality follows from the fact that for any n> 1 the function 
p (x) = (1 ~— 24)" — (1 — nz?) fs nonnegative everywhere on OS x <4, for it 
vanishes when xz == 0 and has a nonnegative derivative @’ (x) = 
<= 2nz [tf — ({ -~ x°)"-)) everywhere on Oz <1 

** Indeed, it suffices to prove that the sequence a, = (1 — 62)". Wn cor 
verges to zero, and this follows. for example, from the fact that since 


lim >a, == (1—8%) lim ni/en (1-8 <i, 
Nhe 


lwo 


the serics “ ay, converges by the Cauchy test (sce Theorem 13.6 in {4}. 
ne § 
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Since f (x), the function under study, is zero outside [0, 4], for 
any x of [0, 4] the integral (1.65) can be written as 
i-x 


Pa(z)= | F(e+t) Qn (t) dt. 


=X 


Replacing t by t — z in the Jast integral we get 


1 
P, (x)= \ f(t) On (t—2) dt. (1.66). 
0 


It is clear from (4.66) and (41.59) that P, (x) is a polynomial of 
degree 27. 

It remains to prove that {P,, (z)} converges to f (z) uniformly on 
0O<zr<i. 

We take an arbitrary ¢« > 0. For given «, by uniform continuity 
of f (x) on the entire infinite straight line, we can find 6 > 0 such that. 


| f(2)—f(y) |< when [z—y| <6. (1.67) 


We further notice that since f (x) is continuous on [0, 1], it is 
also bounded on that interval] and therefore everywhere on the infinite 
straight line. This means that there is a constant [A such that for 
all z 


lf@ IS (1.68) 


Using (1.60), (4.64), (1.67), and U. 68) and the non-negativeness of 
Q (x) we evaluate the difference P, (x) — f (2). 
For allzinO<a#< if we have 


i 
| Pa (z)—T (a) [=| | if @+0—F Qn ( at] < 


i —6 
<J If (e+1)—F(2)1 Qn (t) dt<2A | Qn (#) dt+ 
~- -1 


1 


6 1 
++- | Q,(@dt+24 | Q,@a< 
—§ i) 


<4A Vn. (4—82)" + AIGINEETING, Liathe 


Ae ect Os - eae VAP 


es 
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To complete the proof of the theorem it suffices to notice that for 
all large enough n 


As |e (1-Sy"<S. 


Corollary. If not only the function f(z) tiself, but also iis derivatives 
wp to some order k inclusively are continuous on a closed interval (0, 1}*, 
then there is a sequence of polynomials {P, (x)} such that cach of the 
sequences {P,, (z)}. {Pa (z)}, ..., {PY (z)} converges uniformly on 
{0, 4] to jf (x), f (x), . . «y {0 (x) respectively. 

Indeed, we may assume without loss of generality that each of the 
functions f (x), f’ (z), ..., f(z) vanishes when x = — and when 
x= 1%", and under such conditions f(z) can be extended to the 
entire infinite straight line by setting it equal to zero outside [0, 4], 
so that the extended function and all of its derivatives up to order i 
inclusively turn ont to he uniformly continuous on the entire infinite 
straight line. 

But then, denoting by P,, (x) the same polynomial (1.65) as above 
and repeating the arguments used in proving Theorem 1.18, we 
establish that each of the differences 


Pr(r)—f(z), Palzy—f’ (2), 020 PH (z)— J (2) 


is an infinitesimal uniform in zt onO<2<1. 

Remark 41, The above proof can be easily generalized to the case 
of a f{unetion of m variables f (7), Te, . . -s 2m) continuous in an m-di- 
mensional cube O< 7, <4 (i = 1, 2, ..., m). 

In full analogy with Theorem 1.18 it can be proved that for such 
a function f (2, fa, .. -: 2m) there is a sequence of polynomials in m 
variables 2, 2a, . . ., 2m uniformly converging to it in the m-dimen- 
sional cube. 


Remark 2. Notice that the polynomials of Theorem 1.18 may be replaced by 
functions of a more general nature while retaining the statement about the 
eet ata of uniform approximation by such functions of any continuous 

unction f. 

Let us agree to say that an arbitrary collection A of functions defined on same 
set £ is an algebra if*** (1) f + ¢ € A; (2) f-e € A; (3) @-f EA for arbitrary 
J€A and g€A and for any real a. 

In other words, an algebra is a collection of functions closed under addition 
and gnultiplication of functions and under multiplication of functions by real 
numbers. 


* Of course we may take [a, b] instead of [0,f]. 
** If f (x) failed to satisfy these conditions, we should find a polynomfal 


Py (x) of degree 2k such that these conditions would hold for g¢ (z) = f(z) — 


— Pr (x). ' 
*#* Wo recall that the symbol 7 € A means *f helongs to A. 
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If for each point z of E there is some function g € A such that g (x) = 0, the 
algebra A is said to vanish at none of the potnts x of E. 

A collection A of functions defined of a set E is said to separate the points oj E 
if for any two distinct points x; and z, of the set we can find a function of A 
such that f (x) 4 f (zo). 

We come to the following remarkable statement called the Stone-Weiterstrass 
theorem*. 

Let A be an algebra of functions continuous on a compact** set E that separates 
the points of E and vanishes at no point of that set. Then every function f (x) con- 
tinuous on E can be represented as the limit of a untformly convergent sequence of 
functions of A. 


* M. Stone is a modern American mathematician. 
** Recall that a compact set is a closed bounded set 


CHAPTER 2 


DOUBLE INTEGRALS AND n-FOLD 
MULTIPLE INTEGRALS 


In Volume 1 we discussed physical and geometrical problems 
leading to the concept of single definite integral. 

Typical problems of this kind are the problem of calculating the 
mass of an inhomogencous rod from the known linear density of the 
rod and the problem of calculating the area of a curvilinear trapozoid 

{i.e. the area under the graph of a nonnegative 


z| function y = f (z) on a closed interval [a, b)). 
ze f(x,y) It is easy to citesimilar “multidimensional” 
problems leading to the notion of double or 


volume density p (47) of the body leads us in 
a natura] way to the concept of triplo inte- 
gral, 
To compute the mass of the body 7’, divide 
—-——___. Tinto sufficiently small parts 7,, T., ...,Tn. 
x Wo may roughly assume the volume density 
Fig. 2.4 p (Af) of each part 7), to be constant and equal 
to p (A7;,), where £7; is some point of 7,. In 
this case the mass of each 7), will be approxi- 
mately equal to p(A7;,)-v,, where v, is the volumo of 7. 
The approximate value of the mass of the entire hody 7 is 
Tt 


2, 2 (M a) de. 


LI triple integral. 
Thus the problem of calculating the mass 
of an inhomogeneous body 7 from the known 
y 


It is natural to define the exact value of the mnss as the limit of the 
above sum as every part 7; decreases indefinitely.* We may take 
this Jimit as the definition of triple integra) of p (AJ) over a threc- 
dimensional region 7. 

In a quite similar way we may consider the geometrical problom 
of calculating the volume of the so-called curved-haso cylinder 
(i.e. the volume of the body (Fig. 2.1) lying under the graph of the 
nonnegative function 2 = f (z, y) in some two-dimensional domain D). 


Leen nena 


° Of course the words “decreases indefinitely” should be made more precise. 
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This problem leads us to the notion of double integral of a function 
j (x, y) over a two-dimensional domain D. 

In this chapter we present the theory of double, triple and n-fold 
multiple integrals in general. 

To make more effective use of the analogy of the single integral 
we first introduce the concept of double integral for the rectangle 
and only then do we introduce the double integral over an arbitrary 
domain using both rectilinear and absolutely arbitrary subdivision 
of the domain. 


2.1. DEFINITION AND EXISTENCE OF A DOUBLE INTEGRAL 


2.1.4. Definition of a double integral for the rectangle. Let f (x, y) 
be a function defined everywhere on a rectangle R = la << x <b] X 
X(ex<y<d) (Fig. 2.2). 

We divide the closed interval a < z < b into n subdivisions using 
the points a = m<274,<24,<...<2, = bandc<y<d into 
p subdivisions using the pointse = y,<ySye<...<y=d. 


QA @, Lg Ky Me Gy bd a 
Fig. 2.2 


Corresponding to subdivision by straight lines parallel to the z 
and y axes (see Fig. 2.2) is a subdivision of a rectangle R into n+p 
subrectangles 


Ryr= (pre) X (Yy Sy] 
(b=1,2,...,np L=4,2 «00. p). 


This subdivision of R will be designated by the symbol 7. 
Throughout what follows in this chapter, by “rectangle” we mean 
a recfangle with sides parallel to the coordinate axes. 
On every subrectangle Aj. we choose a point (€, m2). Putting 
Az, = Xp — Chey, AYt = Yr — Yi-1, denote by AR, the area of a 
rectangle Rr t Obviously ARny = Az», Ayt. 
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Definition 1. The number 


nop 
Gas ~ A(Ens tt) AL (2.1) 
is called the integral sum of a function f (x, y) corresponding toa given 
subdivision 7 of a rectangle R and a given choice of intermediate points 
(¢;, 4.) on the subrectangles of T. 

The diagonal] V’ (Az) + (Ay)? will be called the diameter of a 
rectangle 2?; ;. Thesymbol A will designate the Jargest of the diameters 
of all subrectangles 22; ;. 

Definition 2. The number I is said to be the limit of the integral 
sums (2.1) as A +0 if given any positive number ¢ we can find a 
positive number §& such that for A <4, regardless of the choice of points 
(€)5 7) on subreclangles Ry, we have 


lo—JJ<e, 


Definition 3.A function f(z, y) is said lo be integrable (in (he sense 
of Riemann) on arectangle R if there is a finite limit I of integral sums 
of f(z, y) as A 0. 

The limit J is called a double integral of f(z, y) over R and is designat- 
ed by one of the following symbols: 


T= \\ f(z, ydzdy=\f fcanas. 
7% R 

Remark. Just as for a single definite integral (see Section 10.1 of 
(4}), it can be established that any function f (x, y) integrable on a 
rectangle R is bounded on that rectangle. 

We are thus justified in considering only bounded functions 
f (x, y) in what follows. 

2.1.2. Existence of a double integral for the rectangle. Darboux s 
theory developed in Chapter 40 of {4] for the single definite integral 
can be extended in full to the case of the double integral in the 
rectangle J?. In view of close analogy we restrict ourselves to a general 
outline of the arguments. 

Let JJ,; and m,3; be the supremum and infimuin of a function 
f(z, vy) on a subrectangle 2); Form for the given subdivision 7 
of # two sums 

the upper one 


n ? 
S = » >» Myre AR 
Ken] fex{ 
and the lower one 
yo 
rm | 
§ =- S| my ARnt 


Nash led 
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The following statements hold (their proofs are quite similar to 
those given in Section 10.2.2 of [1]). 

1°. For any fized subdivision T and any & > 0 we can choose inter- 
mediate points (E,, N1) on subrectangles R;,, so that the integral sum o 
satisfies the inequalities O<0 S —~o<eé. 

Points (Ep, 42) can also be chosen in such a way that the integral sum 
satisfies the inequalities Do —s<eE. 

2°. If a subdivision T’ of a rectangle R is obtained by adding new 
straight lines to the straight lines generating the subdivision IT, then 
the upper sum S' of T’ is not greater than the upper sum S of T, and. 
the lower sum s’ of T’ is not less than the lower sum s of T, i.e. 


s<s', S'<S. 


~ 


3°. Let T’ and T” be any two subdivisions of a rectangle R. Then the 
lower sum of one of them does not exceed the upper sum of the other. 
That is, ifs’, S’ ands”, S” are respectively the lower and upper sums of 
IT’ ana T", then 


SxS", s"<S'. 


4°, A set {S} of the upper sums of a given function f (x, y) for all 
possible subdivisions of a rectangle R is bounded below. A set {s} of the 
lower sums is bounded above. 

Thus there are numbers 


I= int (S}, I=sup {s} 


called respectively the upper and lower Darboux integrals (of the 
function f(z, y) over R). _ 
It can easily be seen that J<J. 


5°. Let T’ be a subdivision of a rectangle R obtained from a sub- 
division T by adding to it p new straight lines and let s’, S’ and s, S be 
respectively the lower and upper sums of T" and T. 

Then for the differences S — S' and s’ — s we can obtain an estimate 
dependent on the maximum diameter A of a subrectangle of the sub- 
division T, the number p of the straight lines added, the supremum and 
infimum M and m of the function f (x, y) on R and on the diameter 
d of R. 

That is, 


S— S'<(M — m)-p-A-d, 
< ( 


s' — Ss M — m)-p-A-d. 


rr 
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6°, The upper and lower Darbouz integrals I and I of a function 


j (x, uv) over a rectangle R are respectively the limits of the upper and 
lower sums as A — 0*. 

The following main theorem results from properties 1° to 6°. 

Theorem 2.2. For a function f(x, y) bounded on a rectangle R to be 
integrable on R tt is necessary and sufficient that given any & > 0 
there should be a subdivision T of R for which § —s<e. 

As in Chapter 10 of [4], Theorem 2.4 in conjunction with the 
theorem on the uniform continuity of a function allows us to distin- 
guish major classes of integrable functions. 

Theorem 2.2. Any function} (z, y) continuous in a reciangle R is 
integrable on Rf. 

Definition 1. We give the name of an clementary figure fo a set of 
points that area sum ofa finite number of rectangles (with sides paraliel 
to the x and y azes)**., 

Defenition 2. We say that the function f(z, y) has the I-property in 
arectangle J (in an arbitrary closed domain D) if: (1) f (x, y) is bounded 
in?R (in D); (2) given any € > 0 we can find an elementary figure that 
contains all the points and lines of discontinuity of f (z, y) and has the 
area less than e. : 

Theorem 2.3. If afunction f (z,y) has the I-property in a rectan- 
gle R, then it is integrable on R. 

The proofs of Theorems 2.2 and 2.3 are quite similar to those of 
Theorem 10.3 and Theorem 40.4 in [4]. 

2.1.3. Definition and existence of a double integral for an arbitrary 
domain. In Section 11.2.4 of [1] we introduced tho squarability 
and area of a plane figure Q@. These notion can ha carried over without 
any modifications to the case of an arbitrary bounded sct Q of points 
in the plane. 

In all tho definitions and statements of Section 14.2.4 of [4] wo 
could take an arbitrary bounded set Q instead of the figure Q. 

In that section we also gave a definition of a curve (or the boundary 
of a figure) of zero area: Y is called a curve of zero area if given any 
& > 0 we can find a polygon that contains all points of T and has an 
area less than e. 


* The concept of the limit cfupperor Jower sums is defined in full analogy 
with the notion of the limit of integral sums, That is, the number J js sald to be 
the limit of upper sums S as A + 0 if given any © > 0 we can find 6 > 0 such 
that | S—J}<e for A< 64. 

** Notice thata sum of a finite number of absolutely arbitrary rectangles 
(vith sides parallel to the z and y axes) can be represented as a sum of an also 
finite number of rectangles having no interior points In common (with sides paral- 
lel to the above axes). In Definition 1 therefore we can take rectangles both 
with and without common interior points. 
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Note that in the definition “polygon” can be replaced by “elemen-~ 
tary figure”. This follows from the fact that any elementary figure is 
a polygon and any polygon with area less than e is contained in an 
elementary figure having an area less than a number 8e*. 

It is easy to prove the following statement. 

If T is of zero area and if the plane is covered with a square net with 
spacing h, then given any & > 0 we can find h > 0 such that the sum 
of the areas of all the squares having points in common with TI is less 
than &. 

Indeed, given any ¢ > 0 we can find some elementary figure Q 
containing J‘ inside itself and having an area Jess than e/4. After 
that it remains to notice that with 
the spacing of the square net h Y | 
sufficiently small, all the squares R 
having {points in common with [ 
are contained in the elementary 
figure resulting from replacing every 
rectangle @ by a twice as large rec- 
tangle with the same centre. 

It should be stressed that the class 
of zero-area curves is very vast. 
Belonging to this class, for example, 
is any rectifiable curve (see Theo- 
rem 44.3 in [4]). Fig. 2.3 

We now proceed to define the 
double integral for an arbitrary two-dimensional domain D. 

Let D be a closed bounded domain whose boundary I has zero 
area and let f (x, y) be a function defined and bounded in D. 

Denote by # any rectangle (with sides parallel to coordinate axes) 
containing the domain D (Fig. 2.3). 

We define in R the following function: 


f(z, y) at the points of D, 


H(z, =| O at the other points of R. (2.2) 


Definition. A function f (x, y) will be said to be integrable in the 
domain D if the function F (x, y) is integrable in the rectangle R. 


* Indeed, (4) a polygon is equal to a finite sum of triangles; (2) every triangle 
equals the sum (or difference) of two right-angled triangles; (3) a rig t-angled 
triangle is contained in a rectangle twice aslarge in area; (4) any rectangle equals 
the sum of a finite number of squares and one rectangle the ratio of whose sides 
is between 4 and 2; (5) any square is contained in a square twice as large in area 
and with sides parallel to the z and y axes; (6) any rectangle with ratio of its 
sides between 1 and 2 can be completed to a square and is therefore contained 
in a square four times as large in area and with sides parallel to the z and y axes. 
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The number J = \ | I (z, vu) dx du will be called a double 


e 


rE 
integral of f (7, v) over D and designated 


= ( \ f(x, ujdzdy= { \ { (Mf) do. 


D =3) 


Remark 1. From this definition it follows at once that | | 1dr dy 
D 
is equal to the area of D. Indeed, subjecting the corresponding rect- 
angle #2 to smaller and smaller subdivisions we find that the upper 
sums of the subdivisions are equal te the areas of the elementary 
figures containing D and the lower sums to the areas of the elemen- 
tary figures contained in D. 

Remark 2. Let the function f (x, y) be integrable in a bounded squar- 
able domain D, let the plane be covered by a square net with spacing h, 
lel Cy, Ca, . «oy Oncyy be the squares of the net contained entirely in D, 
let (&),, m,) be an arbitrary point of a square C;, and m,, = inf f (z, y) 


Cy 
(= 4,2,..., 0 (h)). Phen each of the sums 


has a limit as h +0 equal to | [ { (x, y) dz dy. 


D 

To prove this it suffices to notice that the sums differ from the 
ordinary integral sum (respectively from the lower sum) of a function 
f (z, y) in D only in having no terms corresponding to squares with 
points in common with the boundary TI of D, the sum of all the 
absent terms being less in absolute value than the product of the 
supremum Jf of the function | f (x, y) | in D and the area S$ of the 
elementary figure consisting of the squares having common points 
with TP. By the statement proved above S > (0 ash — 0. 

Regarding the definition we have given, the question naturally 
arises as to whether the fact of the existence of a double integral and 
its value J depends (4) on the choice of coordinate axes Or and Oy 
in the plane; (2) on the choice of a rectangle R on which we define 
the function / (zx, y). 

In the next subsection we shall give another definition of the 
integrability of a function f (z, y) and of the double integral] that 
depends neither on the choice of coordinate axes nor on the choice of 
n rectangle J? and prove the equivalence of the two definitions. 

lor the time being we shall demonstrate the following main theorem 
which is almost immediate from Theorem.2.5 and the above definition. 
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Theorem 2.4. If a function f (x, y) has the I-property ina domain D, 
then it is integrable in D. 

Proof. For such a function f (x, y) the function F (z, y) defined by 
formula (2.2) has the /-property in the rectangle R. 

Indeed, F (z, y) is bounded in R and all discontinuity points and 
lines of the function either coincide with the corresponding discon- 
tinuites of f (x, y) or lie on the boundary IT of D. Since I has zero 
area, the theorem is proved. 

Corollary 1. If a function f(x, y) is bounded in a domain D and has 
discontinuities only on a finite number of rectifiable curves in that do- 
main, then f (x, y) is integrable in D. 

Corollary 2. If f (x, y) is integrable inD and g (zx, y) is bounded and 
coincides with f (x, y) everywhere in D except for the set of points of 
zero area, then g (z, y) is also integrable in D. 

2.1.4. Definition of a double integral by means of arbitrary sub- 
divisions of a domain. In the subsection above we defined the double 
integral on the basis of dividing a domain by straight lines into a 
finite number of subrectangles. Here we shall give another definition 
of the double integral based on dividing a domain D by curves of zero 
area into a finite number of subdomains of arbitrary form and prove 
this definition to be equivalent to that given above. 

Let D be a closed bounded domain with boundary I of zero area. 
Divide D by means of a finite number of arbitrary curves of zero 
area into a finite number r of (not necessarily connected!) closed 
subdomains D,, D., ..., D,. 

Notice that every domain D; is squarable, for its boundary has 
zero area (see Section 11.2 in [4]) and denote by AD; the area of a 
subdomain JD. 

In every subdomain D; choose a point P; (&;, 7;). 

Definition 1. The number 


o =2 f (P1)-AD, (2.3) 


is called the integral sum of a function f (x, y) corresponding to a given 
subdivision of a domain D into subdomains D; and to a given choice of 
intermediate points P; in the subdomains. 

The diameter of a subdomain D, is the supremum of the distances 
between any two points of the subdomain. The symbol A designates 
the largest of the diameters of the subdomains D,, D,, ..., D;. 

Definition 2. A number I is said to be the limit of the integral sums 


(2.3) as A +0 if given any positive number © we can find a positive 


number 6 such that for A <6, regardless of the choice of points P;, 
in subdomains D; 


lo—I|}<e. - 
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Definition 3 (general definition of integrability). 

A function f (x, y) is said to be integrable (in the sense of Riemann) 
in a domain D if there ts a finite limit I of integrable sums o of { (z, v) 
as A +0. The limit J is called a double integral of f (x, y) over D. 

We prove the following fundamental theorem. 

Theorem 2.5, The above general definition of integrability is equiva- 
lent to that given in Section 2.1.8. 

Proof. It is obvious that if a function f(z, y) is integrable accord- 
ing to the general definition of integrability and its double integral 
by that definition is equal to J, then f (x, y) is integrable according to 
the definition of Section 2.1.3 too and has by that definition the same 
double integra] I. 

Jt remains to prove that if f (z, y) is integrab]e in D according to 
the definition of Section 2.1.3 and / is a double integral of f (zx, u) 
over D by that definition, then for the function f (x, y) there is a 


limit of integral sums G equal to J as A +0, 


Donote hy M, and m, the supremum and infimum of a function 
f(r, y) in a subdomain D, and consider the upper and lower sums 


S= 2 A7,-AD, and s= ba 7 -AD;. 
{= 


{2 
Since for any subdivision 
soy, 


it is sufficient to prove that the two sums S and s converge to J 
as A —- 0, 

We wish to prove that given any ¢ > O we can find 6 > 0 such 
that_cach of thesums S and s deviates from J by Jess than c as soon 
as A <6, 

Take an arbitrary e > 0. For this © we can find a subdivision T of 
a rectangle 22 containing the domain D into subrectangles &;, such 
that for that subdivision 


S—s<5. (2.4) 


Denote by AJ, the supremum | f (z, y) | in D and enclose all tho 
segments of the straight lines generating 7 and the boundary I of 
D in an elementary figure whoso area is Jess than e/4//,. 

Then automatically there is a posttive supremum 6 of the distance 
between two points ono of which belongs to the boundary of the 
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elementary figure and the other to the segments of the straight lines 
generating the subdivision 7 or to the boundary I of the domain D*. 

We prove that for the sums S and s of any subdivision of D satisfy- 
ing the condition A < 6 we have 


S<S+5, (2.5) 
s—5<s. (2.6) 


We restrict ourselves to proving inequality (2.5), for the proof of 
(2.6) is similar. _ 

Remove from S all terms /4,;-AD, corresponding to domains D;, 
each failing to be entirely in one subrectangle of the subdivision T. 
All such subdomains D; belong to the above elementary figure, so 
the grand total of the areas of such domains is less than e/4M,. 


The sum of all removed terms M,- AD, is therefore less than e/4. 
Thus, within the error not greater than é/4, 


S= >" AT,-AD,, (2.7) 


where the prime means that the sum is taken only over subdomains D, 
lying entirely in the corresponding rectangles of the subdivision T. 


Now we replace on the right of (2.7) the suprema M, in subdo- 
mains D; contained in a subrectangle RA, by the supremum MM, in 
R,. Then we get 


xy M,-AD,<> M,,-ARp, (2.8) 


where AR, denotes the area of the domain R, equal to the sum of 
all subdomains D, contained entirely in the subrectangle A. 


All subdomains R, — #,, belong to the elementary figure chosen 
above. Consequently 


2 (ARa— Als) <a 


* Indeed, consider two sets: (4) a set {P} of all the points of the elementary 
figure’s boundary and (2) a set {Q} of all the points of the segments of 7 and 
of the boundary IT of D. Both sets {P} and {Q} are bounded and closed. Suppose 
the infimum 6 of the distance p (P, Q) equals zero. Then we can find two sequences. 
of points {P,} and {@,} such that p (P,, Q,) ~ 0. By the Bolzano-Weierstrass 
theorem we may choose convergent subsequences {P,,} and {Q,,} of {P,} and 
{Q,}, Whose limits P and Q (by closure) belong to {P} and {Q} respectively. But 
then p (P, Q) = 0, i.e. the points P and Q coincide, which is impossible, for the 
set {Q} lies strictly inside the elementary figure and has no common points 
with {P}. The contradiction obtained proves that 5 is positive. 
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and therefore 


Ss — Sy AR, |=|5 a (AR,—AR YL <t, 
h. i, 


Thus, within the error not greater than e/4, 


NO, SMTty, = 8. (2.9) 


Comparing equations (2.7) and (2.9), which hold within the error not 
greater than c/4, with (2.8) we obtain inequality (2.5). 

The proof of inequality (2.6) is similar. 

From (2.5) and (2.6) we get 


s— ecseS S45. (2.10) 


Since hy (2.4) cach of the sums s and S deviates from J by less than 


c/2, each of the sumss and 8S, by (2.10), deviates from J by less thane. 
The theorem is proved. 


2.2, BASIC PROPERTIES OF THE DOUBLE INTEGRAL 


The properties of a double integral (and their derivation) aro 
quite similar to the corresponding properties of a single definite in- 
tegra], Therefore we restrict ourselves to formulating these prop- 
erties, 

1°. Additivity. If a function f (x, y) is integrable in a domain D 
and if a curve LF of zero area divides the domain D into two connected 
domains J, and D, having no interior points in common, then 
f(z, y) is integrable in either of the domains D, and D,, with 


e a 


D 
2°, Lincar property. lf functions 7 (2, y) and g (zx, y) are integrable 

in a domain J and @ and § are any real numbers, thon 

la-f(r, yy -- Beg (x, y)l is also integrable in D, with 


| { f(x, y)drdy = \ f(x, w)dxdy +- \ | f(xy wy) dx dy 
Dy Ds 


| \ tere, YW) +BeK (2, y)jdzdy= 

"p 

= Z | | f(z, y) dzdu--fs [ [ £(z, y) az dy. 
"pd “3 


3°, If functions 7 (z, y) and g (z, y) are integrable in a domain D, 
then so is their product. . 
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4°, If f (x, y) and g (x, y) are both integrable in a domain D and 
everywhere in D we have f (z,"y) < g (zx, y), then 


\\ 4@ yydzdy S \ | e@, y) dx dy. 
D 


D 
Do. If f (x, y) is integrable in D, then so is | f (z, y) |, with 


|| Jee wdedy]|<J J ir@, wldedy. 
D D 


(Of course, the integrability of | f (x, y) | in D does not imply that 
of f(z, y) in D.) 

G°. Mean value theorem. If both functions f (x, y) and g (zx, y) are 
integrable in a domain D, the function g (z, y) is nonnegative (non- 
positive) everywhere in D, and M and m are the supremum and in- 
fimum of f (x, y) in D, then we can find a number y satisfying m < 
<u < M and such that we have the formula 


\j f(z, y\g(z, y)dzxdy=p \ \ g(x, y) dxdy. (2.11) 
D D 


In particular, if f (z, y) is continuous in D and D is bound, then 
we can find in D* a point (€, n) such that »p = f (E, n) and formula 
(2.11) becomes 


\Ji@ wee, vdedy=Fee, w | | ele, vdzay. 
D D 


7°. Important geometrical property. \ | 1-dz dy is equal to the 
D 


area of a domain D. (As was already noted above, this property 
follows immediately from the definition of integrability in Section 
2.1.3.) 


2.3. REDUCING A DOUBLE INTEGRAL TO AN ITERATED 
SINGLE INTEGRAL 


Reduction of a double integral to an iterated single integral to be 
discussed in this section is one of the efficient ways of computing 
a double integral. 

2.3.1. The case of the rectangle. 


Theorem 2.6. Let there be a double integral \ | f (x, y) dx dy for 
R 
a function f (x, y) in a rectangle R= lax<z<bj xX [lexty<d). 


* By Theorem 14.5 of [4]. 
5—O3684 
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Also let there be for cach x of the closed intervala Jz < ba sinule 
integral 


T(z)= \ ft, y) ay. (2.12) 


Then there is an iterated integral 


b ob d 
\ f (x) dz= { dz | f(z, y)dy (2.13) 
a a c 
and we have 
a yo 
| | f(z, w)drdu= da lj (xz, 4) dy. 
“ah a c 


Proof. As in Section 2.1, divide the rectangle R hy means of the 
points asxoyoryiyomcm... car, = db and cH=Yycoy< 
<a... <OMp = d into n-p subrectangles 

Rev [Tarr SERA] X (Yer SVS) 

(k= 1, 2 me se ey 5 l= 4, 2, ae67 Pp). 


Set Ax, = 2p — Trey, AY = Yr — Yr-, and denote by Af,y and my) 
the supremum and infimum of f (z, y) on a subrectangle J2,). Mvery- 
where on 72), then 


yt <= j (7, y) <i Mit (2.14) 


Set z = &,, where &, is an arbitrary point in a closed interval 
[x, _,, J, and after that integrate (2.44) for y going from yy_, to yy. 
We eet 

Vy 


Mp Aly <= { f (Es. y) dy My))° Ay. (2.15) 


Viney 


Summing (2.15) over all / from 4 to p and using notation (2.12) 
we have 


‘ “ 
Nom ANS (En) <> Mar-eAye (2.16) 
irs rs 


Next we multiply (2.16) by Az, and sum over al] 4 from 1 to n. 
We eet 

bi Tt 
v ¥ mp pAZy, Au; ay F(E,)-Az, < <= » ‘ Myr Ag): Au. 


id —w7w, aw 


i | jeat inn Cs :4 


(2.17) 
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Let the greatest diameter A of subrectangles tend to zero. Ther 
so does the greatest of the lengths Az,. The framing terms in (2.17), 
the lower and the upper sum, then converge to a double integral 


|) f@ y) de dy, 


R 
There is therefore a limit on the middle term too in (2.17), equal 


to the same double integral. But that limit by the definition of the 
single integral equals 


b b d 
\ Z(@) dt = { dz \ f(z, y) dy. 


This proves the existence of the iterated integral and equation (2.13). 
Thus the theorem is proved. 

Remark. In Theorem 2.6 we may interchange zx and y, i.e. we may 
assume the existence of the double integral and the existence for 
any y in the closed interval c << y <d of the single integra] 

b 
K(y)=\ f(x wae. 
a 
Then the theorem will state the existence of the iterated integra] 

d d b 

| Kway= | dy Vile, wae 

c ¢ a 
and the equation 

dob 
| #(, yazay= J ay | fle y)az. (2.18) 
R C a 

2.0.2. The case of an arbitrary domain. 

Theorem 2.7. Let the following conditions hold: (1) a domain D is 
bounded, closed and such that any straight line parallel to the Oy azis 
intersects the boundary of that domain in at most two points whose 
ordinates are y, (x) and yy (x), wherey, (2) < ye (x) (Fig. 2.4); (2) the 
function f (x, y) allows the existence of a double integral 


|| f@, wWdedy 
D 


and the existence for any x of a single integral 
¥a(X) 
f (=, y) dy. 
¥4(X) 
5h 
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Under these conditions there is an iterated integral 
"3 ve(x) 


\ dx | f(z, uydy 
2 (2) 
(x, and x. being the smallest and largest abscissas of the points of 1) 


and we have 


Xs ve(x) 
|| f@ wardy = | dx \ f(x, yay. (2.19) 
“p x y(t) 


Proof. Denote by /? a rectangle with sides parallel to the coordi- 
nate axesand containing a domain D, and by F (z, y) a function coin- 
ciding with f (z, y) at the points of D and equal to zero at the other 


vf yh 


=(y) @,(y) = 


A Fig. 2.5 


points of #. For the function F (z, y) all hypotheses of Theorem 2.7 
hold in #2, and therefore we have formula (2.13) which (considering 
that F (x, y) is zero outside D and coincides with f (7, y) in D) goes 
over into (2.19). The theorem is proved. 

Remark {. In Theorem 2.7 we may interchange z and y, i.e. we 
may assime that the following two conditions hold: (1) the domain 
D is such that any straight line parallel to the z axis intersects the 
boundary of the domain D in at most two points whose abscissas are 
x, (uv) and xy (y), where 2, (y) < te (y) (Fig. 2.5): (2) f (2, y) allows 
the existence over D of a double integra] and the existence for any y 
of u sinele integral 

x2(¥) 

\ f(z, uw) dr. 

1D) 

Tf the two conditions hold there is an iterated integral 


vs x<2(¥) 
lav \ f(z, waz 


uy xyty) 
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(y, and wy being the smallest and largest ordinates of the points of D) 
and 


va rg(t) 
| \ f(z, y)dxdy = \ dy | f(r. u)dz (2.199) 
vt as) 


Example. Let a demain D be the circle 22 ~~ yx? < R* (Fig. 2.6) 
and f(z, y) = at (RA -— yy. Any straight line parallel to the z 


y A 


Fig. 2.6 Vig. 2.7 


axis intersects the boundary of D in at most two points whose abscis- 
sas are 2, = —VRE~ y? and re VRE —y (seo Fig. 2.6). 
Therefore, applying formula (2.19’) wo oot 


R Vv hese 
{ { f(x, y)dzdy = { dy | r (Re~ yy" dr= 
2B -Ro Lina 
B Vie n 
= ) (Ree P? f \ x* dz | dy =: = { (R2— y2)3 dy = woe Rt. 
—# ~ Vey =f 


Remark 2. In caso a domain D fails to satisfy the requirements of 
Theorem 2.7 or Remark 4 to it, it is often possible to divide that 
domain into a sum of a finite number of domains of this type that 
have no interior points in common. Then the integral over D is, by 
additivity (sce property 4° of Section 2.2), equal to the sum of tho 
integrals over the corresponding domains. Thus it is possible to divide 
the domain D depicted in Fig. 2.7 into the sum of three domains, 
D,, Dea, and Ds, to each of which we can apply either Theorem 2.7 
or Remark 1. 
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2.4, TRIPLE INTEGRALS AND 1!-FOLD MULTIPLE INTEGRALS 


The theory of double integral we have presented can be extended, 
Witheut Introducing any new ideas or complications, to the case of 
the triple integral or to n-jold multiple integral in general. We shall 
discuss (he main points of the theory of n-fold multiple integral. 

First of all. Jet us define the volume of an n-dimensional rectan- 
gular parallelepiped to he equal to the product of the leneths of all 
of its edges emanating from a single vertex. 

Let us further give the name of an elementary body to a set of 
points of an n-dimensional space that is a sum of a finite number of 
n-dimensional rectangular parallelepipeds without common interior 
points and with edges parallel to the coordinate axes. 

The volume of any elementary body is known to us and is equal 
to the sum of the volumes of the constituent parallelepipeds. 

Now let 2 be a bounded domain in n-dimensional Euclidean space. 
The lower volume of D is the supremum V of the volumes of all 


elementary bodies contained in D, and the upper volume of D is the 
infimum V of all elementary bodies containing 7. 

It can easily be seen that V < V*, 

A domain D is said to be cubable if V = VY. The number V = 


= J’ = V js the n-dimensional volume of D., 


The following statement can be proved in complete analogy with 
the case of the plane domain. 

For an n-dimensional domain to be cuhable, it is necessary and siuf- 
ficient that for any positive number e there should be lwo elementary 
bodies, one containing D and the other contained in D, the difference 
between whose volumes in absolute value is less than e. 

Let us agree to apply the term surface (or manifold) of n-dimensional 
volume sero to a closed set al] the points of which lie in an elementary 
hody of arbitrarily small n-dimensional volume. 

It is obvious that an x-dimensional domain D is cubable if and 
only if the boundary of the domain is a manifold of n-dimensional 
volume zero, 

First an n-fold multiple integral of a function of m variables 
f (xy, Te,» « &,,) iS defined in an n-dimensional rectangular paralle- 
lepiped 7? whose edges are parallel to the coordinate axes. 

To this end we divide each of 7 edges of 2 into a finite number of 
sepments to obtain a subdivision 7 of J? into a finite number of 
n-dimensional subparallelepipeds* *. 


* The inequality V < ¥ can be proved in exactly the came way as P< PF 
was in Section 14.2.6 of (Lf. 

7° It may be said that a subdivision 7 is carried out hy incans of a finite 
number of (n -— 1)-dimensional hyperplanes parallel to the coordinate axes, 
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For the subdivision 7 the integral, upper, and lower sums of any 
bounded function f (z,, t, . . ., Z,) are defined in complete analogy 
with the case 2 = 2. 

The n-fold multiple integral of f (z,, 2, . . .,; Z,) over the paralle- 
lepiped FA is defined to be the limit of integral sums as the length of 
the largest of the diagonals of the n-dimensional subparallelepipeds 
tends to zero. 

As for the case n = 2, Darboux’s theory establishes a necessary 
and sufficient condition of integrability in the following form: for a 
function f to be integrable in a parallelepiped R it is necessary and 
sufficient that given any e > 0 there should be a subdivision T of R for 
which the difference between the upper and the lower sum should be less 
than €. 

After that it is easy to define the n-fold multiple integral of f 
over an arbitrary closed bounded n-dimensional domain D with 
boundary of n-dimensional volume zero. 

This integral can be defined as an integral over 1 D-containing 
n-dimensional rectangular parallelepiped R (with edges parallel to 
the coordinate axes) of a function F coinciding with f in D and zero 
outside D. 

It is natural to designate an n-fold multiple integral of a function 
7] (2, fo, .. -, £,) over a domain D by the symbol 


\j see \ fe. Loy eeey Ln) AL, aL. «+. Aq. (2.20) 
D 


To abbreviate notation where no confusion may arise, however, we 
shall denote the integral (2.20) briefly 


\ f (2) de. (2.20") 
D 


In (2.20’), by x wo mean a point t = (2,, Yq, .. -, T,) in HL”, by 
dx a product dx = dz, dx... .dz,* and by \ an 2-fold multiple 


integral over an n-dimensional domain D. 

Just as for the case nm = 2, we can prove integrability over an 
n-dimensional domain D of any function f possessing in D the /-prop- 
erty (i.e. a D-bounded function all of whose discontinuil.” points lie 
in an elementary body of arbitrarily small n-dimensional volume). 
In general no change in an integrable function f on a set of points of 
n-dimensional volume zero would change the value of the integral 
of that function. 

To define the n-fold multiple integral we can use division of a 
domain D into a finite number of subdomains of arbitrary form by 


* This product is usually called an element of volume in E", 
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means of a finite number of arbitrary manifolds of volume cero, 
Repeating the arguments for Theorem 2.5 we prove that such a gen- 
eral definition of the n-fold multiple integral is equivalent to the 
above definition. 

In complete analogy with Theorem 2.6 and Theorem 2.7 we estab- 
lish the iterated integration formula for the integral (2.20). 

Let an n-dimensional damain D,, possess the property that any straight 
line parallel to the azis Ox, intersects its boundary at at most two points 
whose projections onto the axis Ox, are 


Q (Le, Lg) e+ vy Xn) and b (Lay Lqy «© oy Ln); 


where a (Xa, Lg, .- +) In) SY (To, Fg, . . -y Ly). 
Further let the function f (71, Zo... +} %,) allow the existence of an 
n-jfold multiple integral 


{ { = \ fe. Tay veey Ln) AL, dr, ... dLpy 


cs 


n 
and the existence for any Zo, X53, «+. L, of a single integral 
Leeks cee ty) 


f (2, Day ooey Zn) dzy. 


a 
C(Xq. = is Pr oe xy) 


Then there is an (rn — 1)-fold multiple integral 


b(X3, x3, eosvee xy) 


\ ee | dz, day wee aX, [ f (Zp, Loy ooeg Ly) C2, 


v 
Na] a(xg, x3, eeen xn) 


over an (n — 1)-dimensional domain D,,., which is a projection of Dy 
onto the coordinate hyperplane OreXq, . . », Xp, and we have the iterated 
integration formula 


| f a a | \ f (2s, Lay »e+Ln) dx, dz. eee dz, = 
D,, 
U(xa, X30 ee By) 


= | | cee | dz, dzy ... azn, \ f (2, Tay cony Xn) dx, 


Dy; al{xe, XS. ween xy) 


(2.21) 


Of course, in the statement above any of the variables x, ra, ... 
..., 2, may also play the part of 2. 

We shall agree to say that a domain D is simple if every strai¢ht 
line parallel to any coordinate axis either intersects the boundary of 
the domain in at most two points or has an entire sepment on that 
boundary. 
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Given a simple domain we can use the iterated integration for- 
mula for any of the variables x,, 72, ... 

An n-dimensional rectangular ‘arallelepiped (whose edges are 
not necessarily parallel to the coordinate axes) may exemplify a 
simple domain. 

In conclusion we note that properties 1° to 7° formulated in Sec- 
tion 2.2 for the case of the double integral remain valid for the 
n-fold multiple integral. 


In particular \\-. | 1-dxz,dz,....dzx, is equal to the n-di- 


mensional volume V (D) of a domain D. 

In addition, as for the case » = 2, the following statement holds. 

Let f (ty, La). . -, Zp) be a function integrable over a bounded cubable 
domain D. Also let E™ be a space covered by a net of n-dimensional 
cubes with edge ht; let C,, Co, .» +1 Cnqny be those cubes of the net that 
are contained entirely in D; let (&(, E&), E(®)) be an arbitrary 
point of a cube Ons and let my, be the infimum of f in a cube 
Cy (A = 1, 2, ...,  (h)). Then the sums 


n(®) n(h) 
a f (&), ECh), eos] gh) ° h® and 2 Mp bd h” 


have a limit, as h 0, equal to \ \ we a\ f (1, Loy». +) Lp) Au, X 
D 
X dXo... dz. 


2.5. CHANGE OF VARIABLES IN AN r-FOLD MULTIPLE INTEGRAL 


The purpose of the present section is to justify the formula for 
change of variables in an n-fold multiple integral. 

The formula to be established is one of the major tools for com- 
puting an n-fold multiple integral. 

Suppose that a function f (y1,; Yo: «+ «) Yn) allows the existence of 
the n-fold multiple integral 


J ferddy= J) ce VF Yor eee Yn) Bis dite e+. Gin (2.22) 
D D 


over some bounded closed cubable domain D in the space of variables 
Yr Yor » « +» Yn. Further suppose that we change from y;, Yo, - - +1 Yn 
to some new variables z,, 2, .. +, Zn, i.e. make a transformation 


Y4= py (x4, Loy vooy Ln) 


Yo = the (24, Loy eee, Ln) (2.23) 


Yn = Wr (X4, Voy eve; Zn). 
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Jn shorthand, (2.23) wil) be denoted by the symbol 
y= rp (2), 


where x and y are points of the r-dimensional space z= 
= (Ly, Lay ee ey Ta)y UO (yy Yo » oy Un) aNd of is the collection of x 
functions Py, Pa... -, Wp. 

We denote by D’ the domain in the space of z,, 22, .. ., 2, trans- 
forming under (2.23) to D, i.e. set D = wp (D’)*, 

We prove that if the functions (2.23) have continuous partial 
derivatives of the first order in D’ and if the Jacobian 


aa (17) we (uy. Un, ewes Yn) y) 92 
“Gaey ee ee ae ( £4) 
oa (x) wt (Z); Tny oo ay Xn) 


is nonzero in D’, then for the integral (2.22) the following change of 
variables formula is true: 


| sn dy= \ rt ei] aes 


J) 


dz, (2,23) 


In explicit notation (2.25) has the form 


eee | fs Yay woey Yn) dy, dis _ dy, = 


e a e 


= { | eee Ft (ay, eee, Xn), ceey Wy (Zy, oes, En) % 
op 
a (Uyy wees Un) 


v L) (24, eoaceq Zn) 


Thus we shall prove the following main theorem. 

Theorem 2.8. If the transformation (2.23) converts the domain D’ 
into D and is one-to-one and if the functions (2.23) have in D' con- 
finuous partial derivatives of the first order and the nonzero Jacobian 
(2.24)**, then provided the integral (2.22) erists (2.25’) is true. 

The proof of Theorem 2.8 is not elementary. The basic idea behind 
it is that we first justify formula (2.25) for the case where the trans- 
formation (2.23) is linear and then reduce the general transformation 
(2.23) to that case. 

For convenience we divide the proof into several steps. 

Proof of Theorem 2.8. 

1°. Lemma 1. If a transformation z= 1p(z) is a superposition 


dz, 12. dXpy. (2.25) 


* We assume that the transformation (2.23) has the inverse and that D’ = 
=z 37! (D). 

“* Note that when the hypotheses of Theorem 2.8 hold equations (2.23) 
can be solved for z,, Zs, ..., Zn, the inverse transformation x= 17? (y) 
obtained in this way having in D, by Theorem 14.2 of (1],_ continuous 
wo (z) 


partial derivatives of the first order and a nonzero Jacobian Fw’ 
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(or, as it is usually called, a product) of two transformations y= 


= W, (x) and z=, (y), then the Jacobian ae taken at any point 
= (2 i. zx ) is the product of the Jacobian 2 (y) nd 2 (2) 
T= (Ty, Lo, «2+, Lp Pp ; vé Jaco “ 2 () a Dy) 


0 0 0 
taken respectively at x==(%, Zq, ...) Tp) and y=(Y4, Yo) ---1 Yn)s 


where y=, (z), i.e. 


D2) _ Ze), ZW) (2.26) 


D (24) Zo) +++4 Zn) _ JD (23, Zo, «++ Zn) AY 1) Yor e-+5 Yn) (2.26') 


Proof of Lemma {t. The element at the intersection of the ith 
De) is equal to Ori 
D (2) q zp” 
with the partial derivative taken at a point x. By the indirect 
differentiation rule (see Section 14.7 of [1]) 


row and the Ath column of the Jacobian E 


(3 
OZ; dz, 94! 
on =2 ae oa (2.27) 
=1 
all partial derivatives 5 on the right of (2.27) are taken at 


023 


0 
ay, at the corresponding points y= 


0 
x and all partial derivatives 


1 (x). 
Equations (2.27), trueforanyi = 1,2,...,nandé = 1,2,...,n, 
and the theorem on the determinant of a product of two matrices 
(see “Linear Algebra’) lead directly to formula (2.26). 

The proof of Lemma 1 is complete. 

2°, Before stating the next lemma we recall the definition of linear 
transformation of coordinates. 
Linear transformation is a transformation of the form 


YU = By yl y Ayo Lo + oe ef AnZn, 
Yo = gy Xy + Anse oo e + AgnZp, (2 98) 


Un, = On Ly Ano. + eee + annXn; 


where aj, (i = 1,2,..., 3k = 1, 2,..., 2) are arbitrary constant 
numbers. 
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The linear transformation (2.28) will be briefly denoted by y = Tz, 
where xz and y are points 2 = (Z,), Je, .. +, 2m) and y= 
s= (Uy. Vay se ey My) Of & Space E” and 7 is a matrix T = fl az, |! 
(Gix=41, 2, ..., mn; b= 4. 2. ..., n). 

The matrix 7 is usually called the matrix of linear transformation. 

If the determinant of the matrix of linear transformation det 7 
is nonzero, then y = Tris said to be a nonsingular linear transfor- 
mation. For such a cransformation, by the Cramer theorem*, we can 
solve equations (2.28) for z,, ra, .. -; %, and assert the existence of 


the inverse transformation z = 7'-!y which is also linear and non- 
singular. 

Further notice that for the linear transformation (2.28) the 

co 

Jacobian = = coincides with the determinant of the matrix 7 
of the transformation, i. e. 

Tf (u) 

——— =< <det 7. 2.29 

of (x) ( ) 


The main purpose of this and the next two subsections is to prove 
that the change of variable formula (2.25) is valid for an arbitrary 
linear nonsingular transformation (2.28). In view of relation (2.29) 
it suffices to prove that for any linear nonsingular transformation 
yu = Tr we have the formula 


lfidy= | f(T2)[det Pldz (2.30) 
D T-1D 


(provided there is an integral on the left of the formula). 
In the present subsection we shall prove that (2.30) holds for two 
special types of linear transformations: (4) linear transformation 7} 


consisting in multiplying the ith coordinate by a real number 2 = 0, 
with the other coordinates remaining unchanged**, and (2) linear 
transformation 7';, consisting in adding to the ith coordinate the jth 
coordinate, all the other coordinates except the ith coordinate 
remaining unchanged***. 

Lemma 2. If f (u) is integrable in D, then (2.30) holds for either of 
the transformations 7% and 13. 

Proof of emma 2. Denote by # an n-dimensional rectangular 
parallelepiped containing D and by F a function equal to f inD 
and to zero in # — D. It suffices to prove that for cither of the 


Ce nmpannaameied 


* See “Linear Algebra" for the Cramer theorem. 
** In symbols 
(xy, Fae oe ey Zn) mm (The ee oe There Megy Typsy «0 oy In) 
¢e@ In symbols 
(xy. aor o @ +f In) —t (71> © © «8 Zpu1y a -}- X Jy Ti+is eo e9 In). 
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transformations 7* and 7,; we have the formula 


\ Fy)dy= | F(Tz)-[det Tide, (2.31) 
R T“1R 
in which 7 denotes one of the transformations, Ls or 743. 
An elementary calculation shows that 
detTH=A, det T,,=1. (2.32) 
Moreover, it is obvious that if R is a rectangular parallelepiped 
On <SVn S by (kK = 1, 2, ..., nm), then [74] R is a rectangular 
parallelepiped 
Oy < 2, <b, when k <i, 
b; (2.33) 
f<z, a>. 


and [7;,]"“! R is a fortiori a cubable domain 
Ay S72, <b, when k= ly 


On the basis of the iterated integration formula (2.21) 


b;_, Deas 


by, 
[Pw ay= | - \ \ ves f dys eee Wye dyes «6. 
R ay, 


Oi ny Chay an 


b; 
1+ Wm | F Wty v0es Yn) Qyp. (2.35) 


Applying to a single integral with respect to a variable y; the 
change of variable formula y; = Az; for the case of 7* and y; = 
= x, + a2, for the case of 7;; (see Section 10.7 in [1]) we get: 

(a) 

b; 


| F(t) eee Yn) dye = 


a; 
b/d 
| \ F (ys; coos Yt-ts Nx4, Yt+te oeey Yn) A dz, when A>0, 
afr 

=| alr 
| \ P (yg, coos Yass ACty Ytuts +01 Yn) (—A) dz, when <0; 

| oj” 
(2.36) 


a 
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for the case of 7*, and 


{ % 
(b) 


My 
Ets coos Un) di = 
ay 


hywky 
m= { PU. ceee Miegs Tab Ty Upsgy oes Yn) az; (2.37) 
nya Xy 
for T 15. 


Inserting (2.36) into (2.35), applying once again formula (2.21) 
and taking into account the equation y;, = x), with & = i, the form 
(2.33) of 17%]-' R and the first of the equations (2.32) we obtain 
formula (2.31) for the case of 7%. 

Similarly, inserting (2.37) into (2.35), applying the iterated in- 
tegration formula and taking info account the equations y, = 2,, 
with / =: i, the form (2.34) of [7 j;]-! J? and the second of the equations 
(2.382) we obtain formula (2.31) for the case of 7,;. This completes the 
proof of Lemma 2, 

3°, Lemma 3. Any nonsingular linear transformation T can be re- 
presented as a superposition of a finite number of linear transformations 
of the types T) and T13. 

Proof of Lemma 3. We first verify that the linear transformation 7" 
consisting in interchanging some two coordinates can be represented 
as a superposition of six transformations of the types T* and 71y. 
Indeed, let 7’ consist in interchanging the ith and jth coordinates 
(the other coordinates remaining unchanged), Then it is easy to 
verify that* 


Te TPL TPT TET 3. (2.38) 


Now notice that by a finite number of interchanges of two rows 
and two columns a quite arbitrary linear nonsingular transformation 
7 can be reduced to a linear transformation (2.28) with matrix |] ay, {I 
all of whose so-called principal minors are nonzero, i.e. all determi- 
nants 


Ap |e ee. ‘| (k=4, 2, 2.0, 7). (2.39) 


* Indeed, keeping in writing only the ith and jth coordinates we get, on 
performing the chain of transformations (2.38), (zy. 7j) — (zp 4- ty) 2) 
me (mary mye ty) —e (zy yy my) oe (my my Ty) > (my tz) > (xy, 74). 
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It remains to prove that the last linear transformation can be 
represented as a superposition of a finite number of transformations 
of the types 7% and 7j;. 

We shall prove this by induction. 

Since A, = ay, 340, using 721 we obtain (7%, 2%, ..., Zp) > 


—> (015321, Loy eo « 9% Xp). . 
Now suppose that by superposing a finite number of transforma- 
tions of the types Tt and 7';; we have succeeded in reducing the origi- 


nal sequence of coordinates (#1, Zo, -.-, Zn) to the form 
(Qy,%\+ eee 47,2», eoeey Qpnyty eae + Qpntnys, Ch+4s eaey Zn)s (2.40) 
To complete the induction it suffices to prove that by superposing 
a finite number of transformations of the types T* and T;; we can 
reduce the sequence of coordinates (2.40) to the form 
(Qqih pt wee HOt Ceti coer Geile oes Pane Tnsts 
On assly bt oes Opa ntsyTrtis Theor s+ er Tn). 
First for every i for which the element ai(;,4,) is nonzero we per- 


e ° ° a 
form successively a pair of transformations Tiny, Ge) (for 


those i for which aj(;,4,) = 0 the corresponding pair of transforma- 
tions is not performed). Superposition of all the pairs of transforma- 
tions reduces the sequence (2.40) to the form 


(QyyTy oe HO sy Trtir oo ey Bpi®y-b eee b 
+ Dy ntsyUptis Thtt, Tete, veer Lp). (2.42) 


Further notice that since the minor (2.39) is nonzero, so is the 
determinant 


(2.44) 


Aig coe Ayn Ayentt) 


(2.43) 


Any coe App Ahen+is) 

0 e@@¢@ @) 4 
equal to it. 

But then there are real numbers Aj, ...; An, Az, Such that a 


linear combination of the rows of (2.43) with those numbers is equal 
to* 


Qinet)iy oe er Gertphks Tn4+s)r+i)> (2.44) 


This means that if for every j = 4, 2,..., & +1 for which 
A, 3 0 we perform successively a pair of transformations 7,43); Ti} 


* To prove this it suffices to add to the matrix of the determinant (2.43) 
the row (2.44) and to apply the theorem on the principal minor (see “Linear 
Algebra”). tee 
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(for those 7 for which 2, = 0 the corresponding pair of transforma- 
tions is not performed,) then superposition of all the pairs of trans- 
formations made converts the sequence (2.42) into (2.41). This com- 
pletes the induction and the proof of Lemma 3. 

4°, Lemma 4. For an arbitrary linear nonsingular transformation 
(2.28) the change of variables formula (2.30) holds, provided the integral 
on the left of (2.30) exists. 

To prove Lemma 4 it suffices to notice that formula (2.30) holds 
for cach of the transformations of the types 7} and 7,,; (Lemma 2) 


and that an arbitrary linear nonsingular transformation (2.25) 
can be represented as superposition of a finite number of transforma- 
tions of the types 7? and 7; (Lomma 3), superposition of linear 


transformations involving multiplication of the corresponding Jaco- 
bians (Lemma 1). 

Corollary of Lemma 4. If G is an arbitrary cubable domain in E", 
T is an arbitrary nonsingular linear transformation, then the n-di- 
mensional volume V (G) of G and the n-dimensional volume V (TG) 
of the image TG of G are related by 


V (7G) = |det 7}-V (G). (2.45) 


To prove this it is enough to put in (2.30) f=:_1, D = 7G and 
take into account the fact that 7-“!D = G. 

5°. Now we proceed to justify the change of variables formula 
(2.25) for a quite arbitrary transformation y = w (z) satisfying the 
hypotheses of Theorom 2.8. 

It should be emphasized that when the hypotheses of Theorem 2.8 
hold the integrals on the left and on the right of (2.25) both exist, 
so that we are to prove only the oquality of the integrals. 

Let us agree to denote by Jy,(z) the elements of the Jacobi 
matrix Seb (i=, 2,...,m; f=1, 2,...,m) taken ata point z= 
== (24, Za, ee oy Zr). 

The Jacobi matrix itself, |] J;; (z) |] will be denoted by Jy (=). 

Jt is convenient to introduce the norm of a point x = (2;, Fq,.-- 

.» Z,) and the norm of a matrix A = || ayy |] (§ = 1, 2, 2.4, m3 
jwi, 2, ..., 7). 

The norm of a point x = (2,, La, «+ +» Ly) is @ number denoted 
by the symbol || x ||Jand equal to max |2; | 


wf, 0, cea M 
The norm of a matrix A=|la,,|| is @ number denoted by the 
n 
symbol || A|| and equal to max [S lazy ]- 
im{, 2 i 


TP 1, we wee os 
Notice that with such a definition of the norms of a point and of 
n matrix the equation y = Az implies that 


Hy l< All We (2.46) 
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In addition it is easy to verify that for the unit matrix E we have 
| £ i] = 1. 

In this subsection we shall prove the following lemma. 

Lemma 5. If the hypotheses of Theorem 2.8 hold and if C is an ne 
dimensional cube in D', then the n-dimensional volumes of the cube C 
and of its image (C) are related by 


VOp(C))S [max ll Jp (z) I" -V (C). (2.47) 


Proof. Let C be jan n-dimensional cube with centre at a point 


z = (2, Lo, ee 8g Ln) and with edge 2s. Then the cube C can be 
defined by 


Wo—2 \l<s. (2.48) 


By the Taylor formula for a function of n variables 1p; (x) (see 
Section 14.5.3 in [4]) there is a number 6, in 0 < 6, <1 such that 


ip, (x) — by (2) = x Jay (n+ G (2~2)) (xy—2)). 
From this and from relation (2.46) we conclude that 


0 
1 C2) — (2) max Jy (2) + 22H (2.48) 


Setting y = p (x), y = *p (x), we get from (2.49) and (2.48) 
0 
lly—y <s-max ll Jp (Z) Il 


Thus when a point x is changed inside an n-dimensional cube C 
with edge 2s the image of x remains inside an n-dimensional cube whose 
edge is 25+ max || Jy (z) |l. 

C 


x€ 

This leads at once to the cubability of the image wtp (G) of any cub- 
able set G* (in particular, to the cubability of wp (C)) and to inequal- 
ity (2.47). This completes the proof of Lemma 5. 

6°. Lemma 6. Let the hypotheses of Theorem 2.8 hold and let G 
be a cubable subset of D'. Then for an n-dimensional volume of the 


image  (G) of G we have** 


Vp (G)<J [det Jy (2) Ide. (2.50) 
G 


* Indeed, the boundary of any cubable set G is a set of n-dimensional volume 
zero, and such a set, by the statement proved, can be transformed into a set 
whose n-dimensional volume is also zero 

** The fact itself of the cubability of » (G) follows from the statement proved 
in the preceding Iemma. 


6~—O01684 


“n 
tm 
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Proof of Lemma 6. We first prove that for any non-singular linear 
transformation 7 and for any n-dimensional cube C contained in D’ 


Vy (C))<fdet PY -[max |] Te (x) WP-V (C). (2.54) 


By the corollary of Lemma 4, for any cubable set G and for any 
linear transformation 77! 


V(773G) = {det 774|-V(G). 
Thus if G = 1p (C), then? 

V(b (C)) = J det 7 |-V (L~%> (C)). (2.52) 
We evaluate the right-hand side of (2.52) using (2.47) by taking 


(2.47) not for the transformation wt but for the superposition of 
transformations 7 y. Then 


(4 (C))< [det 7] {max tI ras (2) I" V (C). (2.53) 


sonsidering that the Jacobian matrix of a linear transformation 
coincides with the matrix of that transformation we gel by Lemma | 


I gery (2) = TS ¢ (2). 


But this exactly means that inequality (2.53) may be rewritten 
as (2.51). 

This proves inequality (2.01). 

Now, to prove Lemma 6, we cover the space £” with a net of 
n-dimensional cubes with edge k, and let Cy, Co, ..-, Cre be 
those of the cubes that are contained entirely in G, and let G),, denote 
the suin of al} the cubes. 

On choosing in every cube C; a point z; we write for that cube 
inequality (2.51) setting 7 = J» (z,). We get 


V OF (C1) < [det J, (z1)| {max fF (za) 78 Ay Cz) IBV (6). 
*ehy 


Summing the last inequality over al) i from 1 to » (hk) we get 
nly) 


V (yp GiN< > Jdet Jy. (x4)| {max Wie (zy Se (x) PV (En). 
(2.54) 


Since the elements of the Jacobian matrix Jy (z) are continuous 
functions of a point x in the whole of D’ and all the more so in G 
and the product [Jy (r)}-!- Jy (z) is a unit: matrix whose norm is 
equal to unity, we have 


lim max {[ (Js (2) 17! Se (x) [f= 4. 
haf xely 


ee ° 
* We take into account the fact that 7-7-7! = E,so that det 7-det 77? == 1. 
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But then it follows from the statement formulated at the end of 
Section 2.4 that the limit of the entire right-hand side of (2.54) 


as h 0 exists and is \ | det Jy (x) | dz. 


G 
That same statement implies that lim G, = G, so that in the 


h-0 
limit as k 0 we obtain from (2.54) inequality (2.50). The proof 
of Lemma 6 is complete. 

7°. Lemma 7. Let the hypotheses of Theorem 2.8 all hold and sup- 
pose in addition that f (y) isnonnegative in D. Then the change of var- 
iables formula (2.25) is valid. 

Proof. Cover Z£” with a net of n-dimensional cubes with edges hy 
and let C,, Co, ..., Cyc, be those of the cubes that are contained 
entirely in D. Further let G; = - (C;). Writing for every domain G,. 
jnequality (2.50) we have 


viC)y<| \det Jy (x) Ide. (2.55), 
G; 
Now let m; be the infimum of the function f (y) on a cube C; (or 


equivalently the infimum of f [p (z)] in G;). Multiplying both sides 
of (2.55) by m; and summing over all i from 4 to n (h) we have 


n{h) n(h) 
YS mV (C)< Dm, | [det Sy (2) Iaz. (2.56) 


i=1 i=1 G; 


By the statement at the end of Section 2.4. the left-hand side of 
(2.56) has a limit as kh +0 equal to | f (y) dy. Since the sum of all 


D 
domains G; is contained in D'* and f is nonnegative, the right-hand 
side of (2.56) for any h > 0 does not exceed 


\ £0) @)]- [det Jy (2) dz. 

Dp 

Thus we obtain in the limit as h > 0 from (2.56) 

J FO) ady<) Ftp @)- Idet Jy (2) Iez. (2.57) 
D D’ : 


In our arguments above we may interchange D and D‘ and consid- 
er g (x) = f lw (z))-| det Jy (z) | in D’ instead of f (y) in D. Using 


nth) 
* In view of the fact that >) C; is contained in D, D'=vp-1(D), 


i=1 
G,= 7? (Cj). 
Gs 
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Lemma 41 and tho theorem on the determinant of the product of 
two matrices we obtain an inequality that has an opposite sense 


\7 (tp (z)]- [det Jy (z)fdzx \1 (y) dy. (2.58) 


From (2.57) and (2.58) wo obtain the change of variables formula 
(2.23). The proof of Lemma 7 is complete. 

8°. It remains for us to completo the proof of Theorem 2.8, i.o. 
to got rid of the additional requirement on the nonnegativity of the 
function f(y) imposed in Lemma 7. 

Let f (y) be a quite arbitrary function integrable over D and let Jf 
be the supremum of | f(y) | in D*. 

By Lemma 7 the change of variables formula (2.25) is valid for 
either of the nonnegative functions i, (y) = M and f, (y) = M- 
— { (y). 

But then tho linearity of the intogral implies the validity of 
Yormula (2.25) for the differonco f, (y) — fe (vy) = f (y) too. Tho proof 
of Theorom 2.8 is complote. 

Remark 1. Under tho hypothoses of Thoorom 2.8 we may assume 
the Jacobian (2.24) to vanish on some set of points S of n-dimension- 
al volume zero in D’. Indeed, S lies inside an olomentary figure C 
of arbitrary small area, with by what was proved above 


| fe ddy= | fb) @)- [det Fy (2) | dz. (2.59) 


t(D’ -C) p’-C 


Proceeding in formula (2.59) to the limit with respoct to a soquence 
of clomentary figures {C;} whose n-dimensional volume Y (C,) 
tends to zoro we see that (2.25) is valid also for the caso under con- 
sideration. 

Remark 2, Since the integral 


a re (2.60) 
D 


is equal to the n-dimonsional volume V (D) of D, it is natural to call 
dy, dy. ... dy, an element of volume in the Cartesian coordinate 
system Oyo... Y, considered. 

Using the transformations (2.23) we transform from the Cartosian 
coordinates Yy, Yo, .. +, Yn to somo now, genorally curvilinoar, 
coordinatos 2,, Za, ..., Za. Since under such a_ transformation 
{by (2.25)) the integral (2.60) becomes 

es 4 ed 
r=\l.., [ | =i tee anes tn} dr, dt,... dtp, 
D 


“i (T1, Iny eae Tn) 


* Recall that the integrability of { (v) in D implies the boundedness of f {y) 
in D and the existence of the infimum and supremum. 
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it is natural to call 


an element of velume in curvilinear coordinate system 2,. Ze, ..- 
oe @ “¢ ~° 

Therefore the absolute value of the Jacobian is characterized by 
the “expansion” (or “shrinking”) of volume under the transformation 
irom Cartesian coordinates u,, Ye, «.-, Y- to curvilinear coordinates 
<4. — -= v ee ~r* 


Let us compute the element of volume in spherical and cylindrical 


coordinates. 
1°. For the spherical coordinates {in three-dimensional space) 


z=recos ¢sin 8, 
y=rsingsing, (r>0, (xen, CS <c 2a) 
s=rcos6 


the Jacobian hes the form 


cososin§ —rsingsin@ rcosecos8 
— (=, 3, 5) 
au oe +3 a _ e , o e : —_ a Ld 
Fane > singsin@ rcosqgsin8 rsin mcos@/—r sin 8. 
cos@ 0 ~—rsin 8 


Therefore the element of volume equals r sin @ er G8 de. 
2°. For the cylindrical coordinates {in three-space) 


=T COS ©, 
=Trsing, {(r>0, 0<¢<2x) 


ay 1 { 


| 
¥ 


the Jacobian has the form 


cose —rsing 0 
=jsing rcosg Qj=r. 
0 0 f 


<= (=, 2, 5) 
= ir; ©, 5) 


Therefore the volume element is r ar co as. 
In particular, for polar coordinates in the plane the element of 


area is 7rcr cc. 
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3°. In an n-dimensional space spherical coordinates are defined 
by the equations” 


z,=rsin0,sin0. ... sin0,-.,, 
n~ { 

Im =F COS Om-; |] sinO, when m=2, 3, ...,n—], 
he-m 


rn —=rcosO,-1, 


where the spherical radius r and the spherical angles 0,, Oo, .. ») Gn 
vary over the range r>0, 0O<0,< 2n, OS On Sa for m= 
=23,...,2—1. 

We can see that in this case the Jacobian has the form 


n=1f 


wer 
ats (x15 Tay re | Tn) -— ae [I sin'~! 0; . 
PAT, Oy ee ey On—y) hod 


Thus the volume clement in n-dimensional spherical coordinates is 


ivf 
r'—! dr | sin *-1 0, d0;,. 


(eo 


Example 1. Find the volume of a body bounded hy the surface 

(2° + y? + 2°)? = aX, (2.64) 
where a > 0. 

The body is symmetrical with respect to the coordinate planes 
Oyz and Ozz and lies above the plane Oxy. It is sufficient therefore 
to find the volume of the quarter of the body lying in the first octant. 

Transforming to spherical coordinates we reduce equation (2.61) 
to the form 


r= ai/ cos 0. 
Since in the first octant we have 


m n 
V<0< z, 0<7SF, 


in view of the expression for the volume element in spherical coor- 
dinates the desired volume VY is 


nye a/2 a y/ C08 0 
vy=4 {ap | a0 | sin Oar. 
tr ( ( 
ee 


* The reverse formulas expressing n-dimensional spherical coordinates In 
terms of Cartesian ones are of the form 


t-2 . ny r 
we fret wee OTH, SIN Om = cos 0, = 
r= [ - , 
mist Tay 


aybere Pm -_ Vv a7 -{- «ee “t+ Dt m “= 1, 2, o 9 ef | ed 4. 
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Thus 


Y= 


1/2 

on , . tas 

7a f sin 0cos0d0= rae 
0 


Example 2. Find the area of a figure bounded by the curve 


z> yr r y 
_ ae oe a 2 
Peete (2.62) 
(where h >0, k >0, a>0, b> 0). 

To find this area if is convenient io transform to the so-called 


generalized polar coordinates 
xZ= ar COS ff, 0<0<? 

ae Zit e 

| y=brsing (O<P<2n) 


Equation (2.62) takes the form 


=— cos (p + 2 sin (p. (2.63) 
Since the left-hand side of (2.63) is nonnegative one should take only 
those values of o for which the right-hand side of (2.63) is nonnega- 
tive. 

On multiplying and dividing the right-hand side of (2.63) by 


V +4 and determining ¢, from the relations 


afh b/k 


SIn Gy =o, )— COS ' 7p = ra 


a b2 
V ete V te 
we reduce (2.63) to the form 
r= V/ 44 sin(g +o). (2.63") 


From the nonnegativity condition of the right-hand side of (2.63) 
we find that OC mo + GM <Q, ic. —P)p9 KM PRA — J. ° 


Considering that the Jacobian Z(t, vy) - = 


the desired area S the following expression: 


V/ S45 sin(p+¢r) 


fs 


equals abr, we obtain for 


a~fo 


S = \ de { abr dr = 
0 


‘A~Go 
= Sate } { Sin (P+ Go) dp= 4 y= of pr fe 


~ Ty) 
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We remark in conclusion that in computing a series of areas it is 
convenient to use a somewhat more general form of generalized 
polar coordinates 


z=arcos® q, 
y = br sin™ q. 


It is easy to see that for these coordinates 
A(z, y) 


————- = abr cos=-! m sin®-!p, 
Zr) ? P 


SUPPLEMENT 


ON THE APPROXIMATE CALCULATION OF 7-Fold 
MULTIPLE INTEGRALS 


We shall discuss the question of approximating the n-fold multiple integra} 


[{ .. f (fy) Loy vey 2n) dz, dz, v2. dzp (2.65) 


G,, 
over some domain G, of £", assuming first this domain to be an n-dimensional 
cube. 

Supposing the integral (2.64) exists we consider the question of optimal 
ways of numerical integration. 

The question bas two aspects: (1) constructing numerical integration for- 
mulas optimal for given classes of functions; (2) constructing numerical integration 
formulas optimal for each particular function of a given class, 

Let us consider cach of these aspects separately. 

28.1. Formulas for numerical] integration that are optimal for classes of 
functions. Let G, be a unit n-dimensional cube 0 < 2x, <1, k = 1, 2,..., 7. 


We shall say that f (z,,...,Z,) belongsin G,, to the class DZ (Af) (respectively 


to the class H¥ (3f)) if, provided all the derivatives appearing below exist, we 
have the inequalities 


<M, 


n 
B= > pan, Aan 


(respectively a, < a). . 
We shall use the term cubature formula for an expression of the form 


a 


I... \ f(2q, coe En) G2, we dzn=ly (NER Ih (2.65) 
“ G 


n 
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where 


N 
In (f) = SY) Crf (™, ..., 20). 
k=1 


The points (z{"), ..«, 24) will be called nodes and the numbers C, are the 
weights of a given cubature formula, Ry (f, ly) being its error. 

Our aim is to construct cubature formulas with error estimation accurate in 
order with respect to an infinitesimal 1/N, where N is the number of nodes in the 
cubature formula. 

N.S. Bakhvalov showed* that neither for classes DF (4) nor for classes 
H& (M) one can construct a cubature formula (2.65) with error estimation 


Ry (f, ly) better than C (a, n)-Mf-N~%, where C (a, n) is some constant depend- 
ing on @ and n. 

For classes H% (M) the indicated estimation is obtained (in order with respect 
to 1/N) if we take as 1,, the product of one-dimensional quadrature formulas. 
accurate for algebraic polynomials of degree an — 1. ; 

Assuming that the number of nodes NV is N = m™, where m is an integer, we 
may put 


m 
D Cny -++ Cnt Shy -++1 Zn) ‘ (2.66) 
k=l 


where {zp,, Cr}, V = 1, 2, «.-,m, are the nodes and weights of a one-dimen- 
sional quadrature formula accurate for algebraic polynomials**. __ 

For the error of the cubature formula with 1, defined by equation (2.66) 
we have an asymptotic estimation (i.e., an estimation correct for sufficiently large 
values of NV) 

Ci, (a, n) M 
Ry (f, ly) © Aen ‘ (2.67) 


where C, (a, n) is some constant depending on @ and nr. 


For classes H% (M) there is also a cubature formula close in order of magnitude 
of error to an optimal one. Such a formula is the number-theoretic formula of 
N.M. Korobov***. 


N 
tw $B [se (GE)e (HF) (GP) 8 4), 
(2.68) 


where a;,...-; @, are integers, the so-called optimal coefficients modulo N, and 
To, (x) are some special polynomials of degree « -+ 1. For the error of the cubature 
formula with 1, defined by equation (2.68) we have 


[Rn (fs Lw)l <aene In? N (2.69) 


* N.S. Bakhvalov. On approximating multiple integrals. Vestnik MGU, 
seria matematiki, fiziki, astronomii, No. 4 (1959), pp. 3-18. 
** Such are, for example, the so-called Gauss formula or the Newton-Cotes 
formula (see, for example, “Methods of Computation” by I.S. Beryozin and 
.P Zhidkov). 
N een NM. Korobov. Number-theoretic methods in approximate analysis. 
Moscow, Fizmatgiz, 1963. 
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(C. (a, n) and B are constants depending only on @ and n). The estimation (2.65 
difiers from the estimation best in order only by the multiplier inky, 
Thus there are sufficiently good cubature formulas for each of the class 


DS. (MV) and HS (AM). 

Of course, in practical applications of these formulas one should consider 
their advantages and drawbacks emerging in porticwar situations. The it 
should be remembered that in computing integrals with the aid of farmu)b 
(2.66) the number of nodes N is not arbitrary but is equal to m®™. For example, 
for n:= 10 and the function f (r,, .-.. 2,) behaving roughly “equally” in al} 
directions a reasonable number of nodes will be N == 2 =: 1024. When a greater 
accuracy is desirable, one can take the number of nodes equal to Ny = 3! = 
—~ 59,049, but this will result in an almost 60-fold increase in computational 
work. 

It should also be taken into account that for a “small” or “intermediate” 
number of nodes N the error of a cubature formula obtained using (2.66) may 
greatly differ from the right-hand side of (2.67)*. 

On the other hand, it is more advantageous to use formula (2.66) when com- 
puting large series of integrals or integrals of functions containing: oxpressions 
depending on a smaller number of variables than n. 

Cubature formulas obtained using (2.68) are free of the disadvantage relating 
to the choice of the number of nodes NV. tis appropriate to use these formulas 
for insufficiently smooth functions f and for a Jarge value of the number of 
variables n (beginning with n = 10). It should be noted, however, that for the 
error of a cubature formula obtained using (2.68) one cannot point out a prin- 
cipal term similar to the one on the right-hand side of (2.67). This makes difficult 
both estimating errors in making computations and predicting the number of 
nodes N required for a piven accuracy to be obtained. 

28.2. On formulas for numerical integration optimal for every particular 
function. We note at the outset that the matter is involved and that little 
detailed work has been done on it. 

To begin with, we revise the statement of the question under study. Suppose 
that a given function f (7), 72, .. ., 2) belongs to some class A, and that we 
are given a set of methods of numerical integration {py} of that function f. 
We shall seek in the given set a numerical integration method py such that He 
error Ry (f, p>) is the infimum of errors Ry (f, py) of the set {py} of the given 
function f. 

In other words, we are seeking the best cubature formula for a given pare 
ticular function {, not for the whole class 4, to which that function belangs**. 

Take as a class A, a set of functions infinitely differentiable everywhere in 
an entire cube G,, except possibly some surface § of dimension & <n on which 
those functions may go into infinity as {/rf,, where r,, is the distance between 
a point x =5 (x, ..., 27,) and a point on the surface vy = (yy, 2. 6) Wy), and 
sank — 1. 

The set {py} is defined as follows, 

For every cubature formula o,, exact for algebraic polynomials of degree 
m — 1 we define an clement py of {py} as a cubature formula obtained by 


* Thus when using for (2.66) the Newton-Cotes quadrature formula thejright- 
hand side of (2.67) is close to its left-hand side bopinning with N == (a,)" (for 
instance, for @ = 1 and a == 10, beginning with NV == 10"), while when the 
Gauss formula is used for (2.66) the right-hand sid. of (2.67) is close to the left- 
hand side heginning with NV = (an/2)" (i.e. beginning with Nw 10° for a -- f 
and a oo: 10). Thus in constricting cubature formulas with ly defined by (2.66) 
the Gauss formula is preferred to the Noewton-Cotes formula. 

*¢ A formula best fora class of functions is roughly speaking best for the 
“worst” function of that class, 
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dividing the entire cube G,, into rectangular parallelepipeds and using for every 
such parallelepiped a formula a,, with the proviso that the total number of nodes 
in the entire cube G, should be N. 

It ys natural to expect the nodes of the cubature formula obtained in this 
way to be optimally allocated provided the error for each parallelepiped is 
constant. 

At the computing centre of Moscow State University routines were compiled 
for computing double and triple integrals that realized automatic subdivision 
of domains of integration. They were based on a pair of cubature formulas g,, 
and 0,,, for m, > m. 


The number p = | 0,, — Gm, {| was taken as the error estimation of a for- 


mula G,,. 

If e is a given computational accuracy, then when p < e (for an entire cube 
G,) we take as an approximate value of the integral the one defined by the for- 
mula ¢,,,, and when p > e the cube is divided into 2" parts and the process is 


repeated all over again for each of the parts. 

This method gives good results for computing double and triple integrals. 
As the number of measurements n increases, however, the application of the 
method runs into substantial difficulties arising from the fact that with m and 
m, fixed increasing n drastically increases the complexity of o,, and o,,, and 


with m and m, decreased increasing n drastically increases the number of sub- 
ivisions. 

In conclusion we note that when computing an n-fold multiple integral not 
over an n-dimensional cube G,, but over an arbitrary domain in E™ one should 
first make a transformation converting the domain into an n-dimensional cube. 
Besides, there are cubature formulas for some domains of special form (the ball, 
sphere, etc.)*. 

28.3. Example of approximate calculation of a muJtiple integra}. Consider 
computing the fourfold integral 


R L 22 21 
F(R, L, i= | rdr { p dp \ dp \ [H+ p?-+12—~2pr cos (@—sp)}-3/2 do 
0 0 0 0 


to some accuracy e for the values of parameters 
R = 1; 1.25; 1.53 1.73; 2; 2.25; 2.5; 2.75; 3; L = 0.8; H = 1. 
A change of variables that maps the domain of integration into a unit cube 
reduces the integra] to the form 
4 


141 f 
F(R, L, H)=(2n)t-R2-L? | \ \ | (rep Lap rere 
° 0 0 0 & 
—2RLpr cos 2a (p—)J-3/* rp dp dr dt dq. 


The integrand is smooth. It is natural therefore to nse in computing the 

integral a cubature formula based on (2.66). For each of the variables r and p 

it is natural to take the Gauss formula (a one-dimensional formula exact for 

algebraic polynomials), while for the variables @ and % it is better to take the 

trapezoidal formula (see Chapter 12 in [1]), for the integrand is periodic in each 

of these variables, and for periodic functions the trapezoidal formula gives the 
est results. 


* Thus cubature formulas for the sphere have been studied in the works of 
S.L. Sobolev and his disciples. 
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Thus we get 
F(R, L, Wy= 


m2 my Ms 


5 1 
=(42) > yY yy yy Cy Cash The X 


kyol keet ks=t hyot 
>. 4 H24+-L?r¢ + Riz? —?ERr xz, cos Os kg—ky =3/2 
: hs hy ky kg m - 


here, (zn.)s (C,,,) are the nodes and weights of the corresponding quadrature foz- 
roula). 


To choose the values of m, m,, and m, guaranteeing a required accuracy 
debugging calculations are made, successively increasing the number of nodes 
and comparing the results obtained. 


CHAPTER 3 


IMPROPER INTEGRALS 


The concepts of (single and multiple) integral introduced earlier 
are not suitable for the unbounded domain of integration or for the 
unbounded integrand. 

In this chapter we shall show how to extend the notion of integral 
to these two cases. 


3.4. IMPROPER INTEGRALS OF THE FIRST KIND 
(ONE-DIMENSIONAL CASE} 


In this section we shall extend the concept of definite integral to 
the one-dimensional unbounded connected domain of integration. 

3.4.4. The improper integral of the first kind. One-dimensional 
unbounded connected domains are the half-lines a<iz< +00, 
—oo<a< band the infinite straight line —co <2 < +00. Con- 
sider for definiteness @ <2 <l +0. 

Throughout this chapter, without specifying it in what follows, 
we shall assume that the function f (xz) is defined on the half-line 
a@<ax< +00 and that for any R >a there is a definite integral 
R 


{ f (x) dz which we denote by the symbol F (R): 


° R 
F(R) = \ f (x) de. (3.4) 


Thus under our hypotheses on the half-linea < R < +00 a function 

F(R) defined by relation (3.1) is given. We look at the limiting 

value of # (R) as R —-+- 00, i.e. examine the existence of the limit 
R 

lim \ f (x) de. (3.2) 
R++00 5 


For the expression (3.2) we shall use the symbol 


\ j (x) de. (3.3) 
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In what follows (3.3) will be called improper integral of the first kind 
of a function f (7) over the half-line a <r < +o. 

If the limit (3.2) exists, then the impreper integral (3.3) is said 
to be convergent. If, however, the limit does not exist, the improper 
integral (3.3) is said to be divergent, 

Remark {. Consider the improper integral (3.3). [f 6 > a, then 


along with this integral we may consider the integral f (x) dz. 


Bo) 
Obviously, the convergence of one of these integrals implies the 
convergence of the other. The following equation holds: 


t 


\ f({z)dr= \ } (z) dx -- \ ] (x) dz. 
| a b 


Note that the divergence of one of the improper integrals implies the 
divergence of the other. 

Remark 2. Jf the improper jntegra] (3.3) converges, the value of 
the limit (3.2) is denoted by the same symbo)] (3.3). Thus, if the 
integral (3.3) converges, we use the equation 


R 
f(z)dzr~— Jim { f (x) dz. 
Rwsow + 


Qa 


2 fan, 


h 
Remark 3. The definition of the improper integrals { } (xz) dr 


and \ f (x) dx is similar to that of the improper integral (3.3). 


~ oO 
The first of them symbolizes the operation of proceeding to the Jimit 


b fiw 

lim [ {/ (x) dx, and the second symbolizes lim f f (x) dz. 
H-e-ose R’iw—co + 
Et Ll" -- + ay iad 


Example. Consider on the half-line a<z< oo {a>O0) the 
function f(r) = f/z?, p = const. It is integrable on any interval 
axzzrcih, with 


zi~p [R Ri-p—a-p 
R it = when pl 
dr J ip Is ip 
zP noo. OR 
a Inz = in — when p=14. 
a 
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Obviously, when p> 1 the limit lim| exists and is equal to 
R-co 
a 
1 
i and when p<xci1 it does not exist. Consequently, the improper 


Oo 


integral \= converges when p> 1 and diverges when p<ci. Note 


that when p>1 


\ dx ai-p 


“zp p—1- 


a 
3.1.2. The Cauchy criterion for convergence of the improper in- 
tegral of the first kind. Sufficient tests for convergence. The question 
of the convergence of the improper integral of the first kind is equiv- 
alent to the question of the existence of the limiting value of the 
R 


function F(R) = \ f (xz) de as R +-+oo. As is known", for the 


limiting value of F (R) as R ~ co to exist, it is necessary and suffi- 
cient that /(R) should satisfy the following Cauchy condition: 
given any © > 0 we can find A > 0 such that for any R’ and R” 
greater than A 


R” 
| F(R) — F(R’) |=|\ f(2yael<e. 
R’ 


The following statement is thus true. 

Theorem 3.1 (Cauchy criterion for convergence of the improper 
integral of the first kind). For an improper integral (3.3) to converge 
it is necessary and sufficient that given any « > O we should find A > 0 
such that for any R’ and R" greater than A 


R 
| ) f (2) dz|<e. 
R’ 


Remark. Note that the convergence of an improper integral does 
not imply even the boundedness of the integrand. For instance, the 


integral { f (z) dz, where the function is zero for nonintegral x 


0 
and equals m for x =n (an integer), clearly converges although 
the integrand is not bounded. 
Since the Cauchy criterion is inconvenient for practical] applica- 
tions, it is appropriate to show the various sufficient tests for the 
convergence of improper integrals. 


* See Section 8.1 of [4]. 


cA 
oO 
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In what follows we shall assume that the function f (z) is on 
@xz< oand that for any R > a there exists an ordinary inte- 


A 
eral } (z) dz. 


a 
We prove the following theorem. 
Theorem 3.2 (general comparison test). Let on the half-line 
a=r<oo 


If (z)| Se (2). (3.4) 
Then the convergence of \ g (x) dz implies the convergence of (yf (x) dz. 
a ¢ 


Proof. Let \ g(x) dx converge. Then, by the Cauchy criterion 


{see Theorem 3.1), given any € > 0 wecan find A > 0 such that for 
any f?’ >A and R” >A 


ne 
| | ¢(epaz| <ie. (3.5) 
R’ 


According to the ordinary inequalities for integrals and to inequality 
(3.4) we have 
R° R? n° 
| \ f(z) dz|< { [f(z)ldz< | g(z) dz. 
R’ R re 


 d 


From this and from (3.5) it follows that for any R° and #2” greater 


than A 


| \ f (x)dz|<e. 
hv 


oo 


Consequently, ( f {z) dz converges 


Theorem 3.3 (particular comparison test). Let on the half-line 
O<<lalz< ow the function jf (x) satisfy the relation 


¢ 
[fDisz. 
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where c and p are constants, p> 1. Then \ f (x) dx converges. Jf, 
however, there is a constant c > 0 such that on 0 << a<x< o we 
have f (x) > ~ , where p <1, then \ f (x) dx diverges. 

9 


The statement of this theorem follows from Theorem 3.2 and the 

example of the preceding subsection (it suffices to put g (x) = c/zx”), 

Corollary (particular comparison test in the limiting form). 

If when p> 1 there is a finite limiting value lim |f (x) |x? =e, 
X~> +00 


then \ f (x) dx converges. But tf when p < 1 there is a positive limiting 
a 


value lim f(x) 2? =c>0, then \ f (zx) dx diverges. 


a 
We shall show the validity of the first par! of the corollary. To do 
this notice that the existence of a limit as z —»~-+- oo implies the 
boundedness of the function z” | f (x) |, i.e. given some constant 
Co >> 0 we have 


{ f (x) [<eo/z?. 


After that we apply the first part of Theorem 3.3. The validity of 
the second part of the corollary follows from the following argu- 
ments. Since c > 0, we can find ¢ > 0 so small that e—e> 0. 
Corresponding to this e is A > 0 such that when z >A we have 
e—e<f(z)x? (this inequality follows from the definition of the 
limit). Therefore f (x) > — and in this case the second part of 
Theorem 3.3 comes into play. 

3.1.3. Absolute and conditional convergence of improper integrals. 
We introduce the concepts of absolute and conditional convergence of 
improper integrals. Let f (x) be integrable over any closed inter- 


val [a, R}*. 


Definition 1, An improper integral \ f (x) dz is said to be abso- 


lutely convergent if the integral | [ f (x) | dx converges. 
a 


* Then so is | f (z) ]. 
7—01684 
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Definition 2. An improper integral \ f (x) dr is said to be condi- 


a‘ 2 


a 


fionally convergent if if converges and | | { (x) | dx diverges. 
a 

Remark. On putting g (z) = [ f(z) [ in Theorem 3.2 we see that 
the absolute convergence of an improper integral implies its conver- 
gence. 

Note that Theorems 3.2 and 3.3 establish only absolute convergence 
of the improper integrals under study. 

Here is another sufficient test for convergence of improper integrals, 
suitable in the case of conditional convergence. 

Theorem 3.4 (Abel-Dirichlet test). Let f{ (2) and g (x) be functions 
defined on asx <i oo. Also let f (x) be continuous on am<ar< 90 
and have on it a bounded antiderivative F (z)*. 

Suppose further that g (x) not increasing monotonically on amr < 
<< 00 fends to zero as x —» -4- co and has a derivative g' (x) continuous 
on d= x <t oo, Under these conditions the improper integral 


{ f(z) u{2z) dz (3,1) 


conver pes. 
Proof. We use the Cauchy criterion for convergence of improper 
Re 


integrals. As a preliminary we integrate by parts f (z) g (x) dz 


ie 
on an arbitrary interval [2', R°], R° > R’, of the half-line a< 
<2< oo. We pet 


ne R° 
ad R” os 
( f(x) g(x) dz= F (2) g(x) i — { P(x) g! (x)ae. (3.7) 
ie ie 

Under the hypotheses of the theorem /* (z) is bounded: | F (27) |< 
< K. Since g (z) is not increasing and tends to zero as xz —> -+ 00, 
we have g (z) > 0 and g’ (zt) <0. Thus, evaluating relation (3.7) 
we obtain the following inequality: 


Re 
r H(z) e (2) dz SK fe (RY) k (NK (— g° (z)) dz. 
it’ 


* This means that F (r), which may be defined to be ( } (t) dt, satishes for 


rf 
all x S: a the inequality [ F (r)} << AK, where AK is a constant. 
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Since the integral on the right of this is g (R’) ~ g (R"), obviously 
R" 

| \ fe) g (a)dz|<2Ke(R’). (3.8) 
pe 


Using this inequality it is not hard to complete the proof of the 
theorem. Let « be a positive number. Since g (z) +Qas 7 > +- 0, 
for this « we can choose A so that when R’ > A we have ge(R)y< 
<e/2K. From this and from (3.8) it follows that for any R’ and R’” 
greater than A 


I yer e() dz\|<is 
fs 


which, by the Cauchy criterion, guarantees the convergence of (3.6), 
The proof of the theorem is complete. 

Remark. The requirement on the differentiability of ¢g (x) in 
Theorem 3.4 is superfluous. Theorem 3.4 can be proved by assuming 
only that g (z) is monotone and tends to zero as z — + co: to do this 
we should use the second mean value formula (Bonnet formula). 

Example 1. Consider the integral 

\ Faz, a>0. (3.9) 


Setting f (x) = sin 2, g (x) = 1/r°, it is easy to see that the hypothe- 
ses of Theorem 3.4 al) hold for this integral. Therefore the integral 
(3.9) converges. 


Example 2. Consider the Fresnel* integral \ sin x* dz. According 


0 
to Remark 1 in Section 2.1.4 convergence of one of the integrals 


fae) 


co 
{ sin z* dx and \ sin z* dx implies the convergence of the other. 


0 f 

We therefore turn to the second of the integrals. We have 
a fo ©) { 
\ sinatdz = | zsin x* — az. 
1 { 


Setting f (x) = x sin z* and g (x) = 1/x we can easily see that the 
hypotheses of Theorem 3.4 al] hold and therefore the Fresnel integral 
converges. 


* Augustin Fresnel] (1788-1827) is an outstanding French physicist. 


7? 
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3.1.4. Change of variables under the improper integral and the 
formula for integration by parts. Now we shall formulate the candt- 
tions under which the formulas for change of variables and for inte- 
gration by parts for improper integrals of the first kind are valid. 
Consider first the question of change of variable under the improper 
integral. 

We shall assume the following conditions to hold: 

(1) f (x) ic continuous on the semiaxris a <0 << 09} 

(Q)acxir<t wisa set of values of some strictly monotone function 
z= g(t) given on a<it<io (or ~wo laa) and having on 
it a continuous derivative: 

(3) g(a) =a. 

Under these conditions convergence of one of the following tmapraper 
integrals 


| | f(z) dx and { fle(t))e' (at ( or — | f(g (t)) 2’ (1) de] (3.10) 


implies the convergence of the other and the equality of the integrals. 

This statement is established using the following arguments. 

Consider a closed interval fa, /?}. Corresponding to it by the strict 
monotonicity of g (f) is an interval [a, 0) (or [p, al) of the axis ¢ 
such that changing f on fa, po] results in the values of x = g (f) 
Alling fa, Rj, with g (p) = AR. Thus for those intervals all the condi- 
tions of Section 10.7.3 of [4] hold under which the formula for chanre 
of variable under the definite infegral is valid. Therefore 


R Qn a 
| { (2) dz =: fie(t)) 2’ (dt ( or = \/ (e (t)) 2’ (t)dt). (3.14) 
‘ a P 


By the strict monotonicity of z = g(t), R — co as p — oo, and 
conversely p — coas 2 — co (or 2} + co as p — —coand p — — oo 
as 22 ~—- oo). Therefore formula (5.41) implies the validity of the 
above statement, 

Now weiproceed to discuss the question of integration by parts of 
improper integrals of the first kind. 

We prove the following statement. 

Let functions u (x) and v (z) have continuous derivatives ona <axnr< 
<i oo, and, in addition, let there exrist a limiting value 

dimou (z)ju(z)-= A. 


Newt 
Under these conditions convergence af one af the intezrals 
ac ox 


| u(x) eu" (x) dr or \ u (ryt (x) dz (3.12) 


. 
3} 


+ 
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implies the convergence of the other. Also 


| w(x) v' (2) dz= A—u (a) uv (a)-- \ v (x) wu! (x) dz. (3.43) 


a a 


To prove the statement consider a closed interva] fa, 2}, The usual 
formula for integration hy parts is valid on il. Therefore 

R R 
\ u(x) v' (x) dx = [wu (x) uv (x)? ~{ v (x) u! (x) da. 


Qa 


Since as R — co the expression [uw (x) v (x)JF tends to A — u (a) v (a), 
the last equation implies simultaneous convergence or divergence 
of the integrals (3.42) and the validity of formula (3.13) in the case 
where one of the integrals (3.42) converges. 


3.2. IMPROPER JNTEGRALS OF THE SECOND KIND 
{ONE-DIMENSIONAL CASE) 


In this section we extend the concept of definite integral to the 

case of unbounded functions. 

3.2.4. The improper integral of the second kind. The Cauchy criterion. 
Let f (t) be a function defined on a half-open interval fa, b). The 
point 6 is said to be singular if f (x) is not bounded on la, b) but is 
bounded on any closed interval [a, b — a] contained in [a, b). Tt 
is also assumed that f (x) is integrable on any such interval. 

Under our hypotheses, on (0, b — a] the function of the argument 
a is given defined by the relation 

b-a 
F(a)= { | (xv) dz. 
a 
We investigate the question of the right-hand limiting value of 
F (a) at the point @ = 0, i.e. question of the existence of a limit 
bo 


lim | f (x) dz. (3.14) 
a++0 ¢ 


For the expression (3.14) wo shall use the symbol 
b 


| f(z) de, (3,45) 


In what follows (3.15) will be called an improper integral of the 
second kind of a function f (xz) on a half-open interval fa, b). If 
there exists a limit (3.44), then the improper integral (3.15) is said 
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to be converzent. But if that limit does not exist, then the improper 
integral (3.15) is said to he divergent. Tf the improper integral (4. 19} 
converges, the value of the limit (3.14) is denoted by the same symbe] 
(3.15). Thus, in case (3.415) converges we use the equation 

9) 


trae 
\ f(zdr= lim | i (x) dz. 
t—~O f 

Remark. The notion of improper integral of the second kind is casy 
to extend to the case where f (x) has a finite number of singular 
points. 

Example. Consider on fa, b) a function 1s(b — xz). p> 0. Tt is 
clear that & is a singular point for that function. Lt is also obvious 
that the function is integrable on any closed interval la, & — al, 
with 


h— zi? ha i —njin-p —_ ain} . 
bate — ea yr a TP A when psf, 
dz I—p fa 1 7 
. b--7)h ~~ +o L—-a 
t — In (h—z) = Jn when pot. 
G 
UV—z 


(u—eprh 


Clearly the limit lim 1—/; 


Roth 


dx oe. . 
\ Goa exists and is equal to 
a 
when p<ci and does not exist when p>{. Consequently, the 
improper integral in question converges when p<01 and diverges 
when pi=i. 

We now state the Cauchy criterion for convergence of the improper 
integral of the second kind. We shal] assume that f (x) is given on 
la. b) and that 4 is a singular point of the function. 

Theorem 3.5 (Cauchy criterion). For the improper integral of the 
second kind (3.15) to converge it is necessaru and sufficient that civen 
any & > Owe should be able to find 6 > U such that for any a’ and a’ 
satisfying the condition O<la° <a’ <6 


tua? 


| | 


ba’ 


j(z)dz|<e. 


The validity of the theorem follows from the fact that the notion 
of convergence of an integral is by definition equivalent to the notion 
of the existence of a limiting value of J (a) introduced at the begin- 
ning of this snbsection. 

3.2.2. Concluding remarks. We shall not expound the theory of 
improper integrals of the second kind. This is because the main 
conclusions and theorems of the preceding section may be carried 
over without difficulty te the case of integrals of the second hind. 
Therefore we shalf confine ourselves to same remarks. 
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4°. Under some constraints on integrands integrals of the second 
kind reduce to integrals of the first kind. Namely, let f(z) be contin- 
uous on [a, b) and let b be a singular point of the function. Under 


these conditions we can make the following change of variables in 
ba 


\ f (x) de: 
4 dt | 4 

t=ab——, dta-e, Gog Stsy 
As a result we get 

b-a i/a ‘ ‘ 

\ f(jde= | f(b—+—)pdt. (3.46) 
a 1/(b~a) 
b 


Let \7 (x)dx converge. This means that there exists a limit 
a 


O-& 
lim \ jf (z)dx. Turning to equation (3.16) we see that there is 
a-+0 


also a limit as 1/a-»-+ oo of the expression on the right of (3.16). 
This proves that the improper integral of the first kind 


(v4) Lar 
1/(b—a) 


b 
converges and is equal to \ f (x) dz. Convergence of the above impro- 


b 
per integral of the first kind clearly implies convergence of \ f (x) dx 


a 
and equality of the two integrals. So convergence of one of the integrals 


[1 (a) ae or ( f(b—-+) at 
a 1/(b-a) 


implies convergence of the other and equality of the integrals. 

2°, For improper integrals of the second kind it is easy to prove 
statements similar to those of Section 3.1.2 which may be combined 
under the general heading “comparison tests”. Note that in all formu- 
lations f (z) is considered on [a, b), where b is a singular point of 
the function. 

A particular comparison test will have the following form. 

If | f (@) | <e (b — zx)’, where p < 1, then the improper integral 
(3.15) converges. If, however, f (z) Sc (b — z)-?, where c > O and 
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p => 1, then the improper integral (3.15) diverges. The proof follows 
from the general comparison test and the example considered in the 
preceding subsection. 

In analogy with Section 3.1.3 we could formulate for improper 
integrals rules of integration by change of variable and of integra- 
tion by parts. 


3.3. THE PRINCIPAL VALUE OF AN IMPROPER INTEGRAL 


Definition. Let f (z) be a function defined on a straight line — co < 
<2 <0 oo and integrable on every segment of that line. We shall say 
that f (x) is integrable in the sense of Cauchy (Cauchy integrable) 
if there is 


R 
lim | f(z)dz. 


R- too sy 


We shall call this limit the principal value of the improper integral 
of f (x) in the sense of Cauchy and denote it by the symbol* 
~ y 
Vp. | {(z)dz= |hm | f(x) dz. 
90 Nowe -R 
Example 1. Find the principal value of the integral of sin 2. 
Since by the oddness of sin z 


R 00 
[ sinzdz=0, we have Y.p. | sinzdz=0. 
~R a 


The following statement is true. 

If f (x) is odd, then it is Cauchy integrable and the principal value 
of the integral of if is zero. 

I} f (x) is even, then it is Cauchy integrable if and only tf the improp- 
er integral 

| f(z) dz (3.17) 


0 
convcr ges. . 

The first part of the statement is obvious. To prove the second 
part it is sufficient to use the equation 


Rt R 
| f(r) dz =2 { f(x) dz, 
~R 0 


~ © Yup. are the initial letters of the French words Valeur prinetpal, “principal 
vahue”. 
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valid for any even function, and the definition of convergence of 
the improper integral (3.17). 

The concept of Cauchy integrability may be introduced for 
improper integrals of the second kind also when the singular point 
is an interior point of the interval over which we integrate. 

Definition. Let f (x) be defined on a closed interval la, b], except 
possibly a point c, a<(c <b, and integrable on any closed interval 
not containing c. We shall say that f (x) is Cauchy integrable if there: 
is a limit 


_ b 
lim ( fr pact | 1) dx) = V.p. J F@ dz 


called the principal value of the integral in the sense of Cauchy. 
Example 2. The function — is not integrable on {[a, 0}, 


a<c<ib in the improper sense. it is Cauchy integrable, how-- 
ever. We have 


b b 
V.p. | ae = lim ( \= — +) 
D a c+a 


3.4, MULTIPLE IMPROPER INTEGRALS 


This section extends the concept of multiple integral to the cases: 
of the unbounded domain of integration and of the unbounded inte- 
grand. Recall that it is these cases that were omitted by us from 
consideration when constructing the theory of multiple integrals. 

Note that we shall formulate the notion of improper multiple- 
integral so that it embraces both the case of the unbounded domain. 
of integration and the case of the unbounded function. 

3.4.1. Multiple improper integrals. Let D be an open set* of an 


m-dimensional Euclidean space E™. We shall use D to denote the- 
closure of D obtained by adjoining to D its boundary. We shall 
need the concept of a sequence {D,,} of open sets monotonically 
exhausting the set D. 


We shall say that {D,} monotonically exhausts D if: (4) for any n 


the set D, is contained in D,,44; (2) the union of all sets D,, coincides- 
with D**, 
Note that each set D, of {D,} is contained in D. 


* A set is said to be open if it consists of interior points only. An open set: 
is also called a domain. 
** A°union of all sets D, is a set | D containing all the points of each of the- 
sets D, and such that each point of D is in at Jeast one of the sets D,. 
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Suppose on D a function f(z), 7 = (Ty, Ta, oe ey Tm) IS Civen, 
Riemann integrable on any closed cubable subset of D. We shall 
‘consider all possible sequences {D,} of open sets monotonically 
exhansting D and having the property that the closure 2, af every 
set 2, is cubable (from which in particular it follows that each of 
the sets D, is hounded). 

If for any such sequence {Dy} there exists a limit of the number 


SEQUENCE 1 f (x) dx \ and that limit is independent of the choice of 


Dy 
sequence {D,}, then the limit ts said to be the improper integral af 
the function f (x) over D and is denoted by one of the following symbols: 


{ cee | P(Xt. Tay ees Tm) Ut, date... I2y_. (3.18) 


{ f(x) dz, 


pp Dd 


The improper integral (3.18) is called convergent. 

Note that the symbo} (3.48) is used also when the limits of se- 
‘quences D, do not exist. Then the integral (3.18) is called divergent. 

3.4.2. Improper integrals of nonnegative functions. We shall prove 
the following theorem. 

Theorem 3.6. For the improper integral (3.18) of a function f (x) 
nonnegative in D to converge it is necessary and sufficient that for at 
least one sequence of cubable domains {D,,} monotonically exhausting D 
the number sequence 


Wits (3.19) 


D 


Gy 


3 


Should be bounded. 

Proof. The necessity of the hypothesis of the thearem is obvious: 
the sequence (3.19) is nondecreasing (Y, is contained in D4. and 
f(z) SO), and therefore a necessary condition of its convergence is 
its boundedness. We proceed to prove the sufficiency of the hypo- 
theses of the theorem. Since (3.19) is bounded and is not decreasing, it 
converges to some number /. [t remains to prove that to the same 
number / converges the sequence 


Qy { f(rdr, 
Din 


where {Dj} is another arbitrary sequence of domains monotonically 
exhausting D. Take any integer 2 and consider a domnin D,. We 
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can find n, such that Dj, is contained in D,,*. Therefore 
Anan, <i. 


It follows that {a,} converges to some number J’ < /. Interchanging 
in our arguments the sequences a, and a, we arrive at the inequality 
I<J’. Consequently. 7’ = J. This completes the proof of the 
theorem. 

Example. Consider the integral 


\ e-s*-2 dx dy (3.20) 
D 


=| 


taken over an entire plane. Take the following system of concentric 


circles D, to be a system of domains {D,} monotonically exhausting 
D: 


rety<an, n=, 2... 


In each of such circles D, transform to a polar coordinate system r, 
q- We get 


22 mn 
a,= | | e-ato2" dz dy = [ | e-" rdrdg=a(i—e-™). 
‘Dr 0 0 
It follows that lim a, =a. By the theorem just proved the 


integral (3.20) converges and is equal to x. Note that the integral 
(5.20) may be represented as** 


= © 7 , 
l= \ e-~ az | e-V dy ={ edz). 
a soe os 


From this representation we obtain the value of the integral called 
a Poisson integral***: 


xs 


| e-“dr= Wan. 


a 
=—m 


* Suppose this is not the case. Then for any integer 4 we can find such a point 
AY, of the domain D;, which is not in a domain Dz. We can choose (by virtue of 
the closure and boundedness of Dj) a subsequence of the sequence {.1/,} con- 


verging to some point VW € D7. The point Af. together with same neighbourhood, 
helongs to one of the sets D,,. But then belonging to the same set D,, and to all 
the sets D, with large integers are points .; with arbitrary large integers. But 
this contradicts the choice of point 1,. 

** Tt is easy ta sce that such a representation is possible. if we take as the 
exhaustive system of domains a system of increasing squares with centres at 
the origin and with sides parallel to the axes and then apply the formula for 
iterated integration over each of such squares. 

=** 5.D. Poisson (1781-1840) is a French mathematician and physicist. 
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-7 (general compar; 
parison test). Let non) 1 
I (2) and g (z) Satisfy everywhere in D the condition ane Minetion 


I (z) < g (2). 


Then convergence of the improper integral j & (x) dz implies 4p 
e 


D 
convergence of the improper integral \ f (x) dz, 


D 
_ Proof. Let {Dn} be a sequence of domains monotonically exhaugi. 
Ing D. From the obvious inequalities 


a= | f(a)dz< | g(z)dz=o, 


s 
— 


Dn Dn 


it follows that the boundedness of the sequence 0b, implies that of 
the sequence a,. From this and from Theorem 3.6 it follows that 
the above theorem is true. 

It is common in investigating improper integrals for convergence 
to use standard comparison functions the most common of which is 


the function g(r) =. Jz [ip 0%, 
Example t. Let a> 0, D bea ball of radius a with centre at the 
origin, and g (x) = | 2 |-?. Take a System of concentric fibres D, 


formed by removing from D balls of radius 1/n with centre at the 
origin to be a sequence {D,,} of domains monotonically exhausting D. 
Introducing a spherical coordinate system (see Section 2.5.3°) we get 


Qn = \ § (z)dr= 


Dn 
a Z : m= 1 
= ) r-P+m—1 dr j de, dO, ... (I nail 6, | dOm-1- 


Denoting by w,, a positive number 


2n nt x m-! 
i= | 28 dO, ... ( Tl sin*-! 6, ) dOm—1, 


* It is assumed that |z| = Vxp-- 23+... + th. 
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we may write 
a 
Un=Om \ r7Ptm-ladr, 
ifn 


It follows that the sequence a, is bounded and hence converges 
if and only if p<(m. By Theorem 3.6 the improper integral of 
{z |? in D converges when p<(m and diverges when p > m. 

Example 2. Let a > 0, D be the exterior of a ball of radius a 
with centre at the origin, and g (z) = |zJ-°. Take a system of 
concentric fibres D, consisting of all the points z € &™ satisfying 
the condition 


a<x|{z|l<in 


to be a sequence {D,} of domains monotonically exhausting D. 
Using a spherical coordinate system we get 


An = \ g(x) dx = Om \ r-prm~1 dr 
Dn a 


From this and Theorem 3.6 it follows that the improper integral 
of ]|z |-? in D converges when p > m and diverges when p < m. 
3.4.3. Improper integrals of functions that do not maintain sign. 
Here we shall discuss the relation between convergence and absolute 
convergence of multiple improper integrals. As in the one-dimension- 


al case, the improper integral { { (z) dz is said to be absolutely 
D 


convergent if the integral \ | f (z) | de converges. We shall prove 
D 


that absolute convergence of an integral implies the usual conver- 
gence. The most remarkable is another property of multiple improper 
integrals that has no analogue in the one-dimensional case, which is 
that convergence of an improper multiple integral implies its abso- 
lute convergence. In other words, we shall prove that for improper 
multiple integrals the concepts of convergence and of absolute conver- 
gence are equivalent. 

Before turning to the proof of these properties we shall make some 
preliminary remarks. 

It follows from the definition of the improper integral that if the 
improper integral over D of each of the functions f, (x) and f_ (z) 
converges, then so do the integrals of the sum or the difference of 
these functions. 

Consider the following two nonnegative functions: 


fi (x) _ | f (x) Lord (z) ; f. (x) = | f(z) iat (x) . (3.24) 
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Obviously they may be defined by the relations 
if f(x)>0. 


, fF), 
i+ (7) =, if f(r) <0, 


(3.22 
. — f(z), if f(r} <0, 
}. (2) = { 0, if f(z) >0. 


Note also that the following obvious relations hold according to 
the definition of functions fs (7) and Ff. (x): 


0< fh @M<lf@bh G</-@M<lif@ |. (3.23) 
f (x) = f+ () — f- @). (3.24) 


We now proceed to prove the statements made earlier in this subsec- 
tion. 
Theorem 3.8. The absolute convergence of a multiple improper 


integral f (x) dr implies its usual convergence. 


D 

Proof, We turn to the functions fs (x) and 7_ (xz) just introduced. 
The integrability in the proper sense of f (z) over any cubable subdo- 
main of D implies the integrability over D of { f (x) { and therefore 
from formulas (3.21) too it follows that /4. (z) and f_ (x) are also 
integrable over any such subdomain. Using the convergence of 


[f (x) [ dz, the just indicated property of fy (7) and fu (x), in- 


D 
equalities (3.23) and Theorem 3.7, it is easy to see that the improper 


integrals | fa (x) dx and | j. (zx) dr converge. From this and fram 
p D 
relation (3.24) it follows that so does { (x) dx. The proof is com- 
D 
plete. 
Now we shall prove the converse. 


Theorem 3.9. Ifa multiple improper integral [ } (x) dx converges, 
D 
it does so absolutely. 

Proof. Suppose the statement of the theorem is false. From Theo- 
rem 3.6 then it follows that the sequence of integrals of {| f (x) | 
aver any sequence of domains {D,} monotonically exhausting D 
will be an infinitely large monotonically increasing sequence. IJt 
follows that {D,} may be chosen so that for any n = 4, 2, ... 


| Pf) lde>3 | 14 (2) | dz+2n. (3,25) 


Dra Ln 
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Denote by P, a set D,4, — D,. Then from (3.25) for any n we get 


f1f(@) [de>2 | 1 F(2) [det 2n, (3.26) 
z, Bn 

Since | f (x) | = f+ (&) + f. (z), we have 
(if (2) |dr= \ fa (2) dx { f(x) de. (3.27) 
P,, Py, P, 


Suppose of the two integrals on the right of (8.27) the largest is the 
first. Then from relations (8.26) and (3.27), for any n 


| fea) de> | 17 (@)laztn, (3.28). 


Pr Dy 


Divide the domain P, into a finite number of domains P,, so that 
the lower sum >} m;Ao; of fa (x) for this subdivision is different 
i 
from the integral over P,, of f+ (z) to so small an extent that chang- 
ing on the left of (3.28) the integral of that lower sum yields the 
following inequality: 
»y m,A0; => \ | f(z) | dun. (3.29): 
i Dn 
Since m; > 0, it is possible to leave in >} m;Ao; only the terms for 


which m; > 0. The union of the corresponding domains P,, , Will be 


denoted by Pe 
In a domain P, the function f (x) is positive and therefore f (x) = 
== f4 (x) in it. By (8.29) therefore we have 


\ f(@)ax> J | f (x) {dx-tn. (3.30) 
= Dn 


Pr 


/ 


note by Da the union of D, and P,,. Adding inequalit 3. 
to the obvious inequality ¥ 6.30) 


fie) dz>— J} If (2) | de, 


— 


Dn 
we then have 


[paaee™ (3.31) 


—*k 
Dn 
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‘Clearly the sequence of domains {95} monotonically exhausts the 
domain 2. But then, by (3.51), | ] (x) dr diverges. Since under the 


D 
hypothesis this integral converges, the assumption that the state- 
ment of the theorem is false does not hold. This completes the proof 
of the theorem. 

3.4.4. The principal value of multiple improper integrals. 

Definition. Let f (x) be defined for all x € E™ and integrable in 
every ball Kp of radius Rowith centre at the origin. We shall sau that 
f (x) is Cauchy integrable in £™ if there exists a limit 


lim | f(x) dz. 
Rew i 
“* RR ; 
This limit is called the principal value of the improper integral 
of f (x) in the sense of Cauchy and designated 


Vp. \ f(x) dx= lim | f(x) dz. 
R 


pm R-~-~ 5 


Example. Let f (x) have in spherical coordinates the form f (x) = 
= h (r) g (0,, Oo, . ~~, Om-y), Where the functions hk and g are 
continuous, with 


oi 1 % t= { 
| 20, | dO... | 8 (04, Ory 0, Bin ~1) ( II sin'-!0,) dOm—1=0. 
if 0 0 krof 


Then clearly f (z) is Cauchy integrable and 


Vip. | f(z)dz=0. 


“mn 
E 


In particular, for m = 2 the function of two variables f (z, y) = 
= h (r) cos pis Cauchy integrable and the integral of it in the sense 
of the principal value is zero. 

In the case where f (x) has a singularity at some point. z, of D 
the integral in the sense of Cauchy is defined as the limit 


Vop. | f(z) dz = lim | f(x) az, 
p Ro Dp 


where Dp is a set obtained by removing from D ao ball of radius & 
with centre al 2. 


_ CHAPTER 4 


LINE INTEGRALS 


In this chapter we extend the concept of one-dimensional definite 
integral taken over a line segment to the case where the domain of 
integration is a segment of some plane or space curve. 

Integrals of this kind are called line integrals. In applications it 
is customary to consider line integrals of two kinds (of expressions 
having a scalar and a vector meaning). In this chapter line integrals 
of the first and the second kind are discussed simultaneously. 


4.1. DEFINITION OF LINE INTEGRALS AND THEIR PHYSICAL 
INTERPRETATION 


Consider in the Ozy plane some rectifiable curve J having neither 
points of self-intersection nor overlappings. Suppose that the curve 
is defined by the parametric equations 


| x= (t), 


y= (t) (axt<b) (4.4) 


and first consider it to be unclosed, with end points A and B. 
Suppose further that 


the function f (z, y) | the two functions P(z, y) and Q(z, y) 


are defined and continuous along the curve L= AB*. 

Divide the closed interval a<t<b using the points a = 7%, 
<t<t<...<t, = bd into n subintervals [t,_,, t,] (k = 
= 41,2, ..., n). 


* The function f (x, y) is said to be continuous along L if given any e > 0 
we can find 6 > 0 such that | f (z1, v1) — f (za, yo) | < & for any two points 
(x1, y;) and (ze, ye) of L satisfying the condition V (@ — 20)" + (yy) — yo)? < 5. 
In fact we have defined not the continuity but the uniform continuity of 7 (z, y) 
along L, but since the set of all points of Z is bounded and closed these notions 
coincide. 


8—01684 
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Since on L a certain point Af, (2), uy.) with coordinates x, = 
=  (t:), uv, = tp (t;,) corresponds to each value of t), this subdivi- 
sion of alt <b dividesthecurve L = AB into n subarcs M54), 
MyAla, . 6) My al, (Fig. 4.4). 

Choose on every subare Af, _,57, an arbitrary point Vy (&, vs) 
whose coordinates €,, 1) correspond to some value t, of the para- 


y | 


Fig. 4.4 


meter ¢f, so that ¢, = p(t), MW, = WP (tr), with th.» St <b). 
Let us agree to denote by Aj, the length of the Ath subare J/;,_.M), 
(ik = 4, 2, ..., n). 


Form the integral sum Form the two integral sums 
Nn n 
‘2! rs , \* + 
Oo, = (Ens Mn) Ad,.. (4.2) = P (Ex, Nn) (Xp, —~— Xp~4)s 
(4.2°) 


nN 
o3= = Q (En. Wd (Yn — My aa) 
pecs 
(4.2”) 


The number J is said to be the limit of the integral sum o, (s = 1, 2, 3) 
as the largest of the Jengths Ad, tends to zero, if given any e > 0 
we can find 6 > 0 such that |}o, ~— J |<e as soon as the largest of 
the lengths Ad, is less than 6. 


Definitions 


If there exists a limit of the If there exists a limit of the 
integral sum o, as the largest of | integral sum o. los] as the largest 
fhe lengths Al, tends to zero, then | of the lengths Al, tends to zero, 
this limit is said to be a line | then this limit ts said to be a line 
integral of the first kind of | integra] of the second kind and 
{(z,y) along DL and designated | designated 


| fe yal { Piz, wazl | @(e may]. 
L 


ADB AB 
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or 


\ f(z, y)dl. 


AB 


(4.3) 
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It is customary to call the sum 


P(x, y)dz+ \ Q (x, y) dy 
AB AB 


the general line integral of the 
second kind and to designate it 
by the symbol 


| P(x, y)de+Q(z,y)dy. (4.3") 
AB 


Now we shall give a physical interpretation of the line integrals 


we have introduced. 


Suppose a mass with linear 
density f(z, y) is distributed 
along the curve L. To compute 
the mass of the entire curve, it is 
natural to divide the curve into 
small segments and, assuming 
the density to vary little on each 
segment, to set the mass of each 
segment approximately equal to 
the product of some intermediate 
value of density by the length 
of that segment. 

In such a case the mass of the 
entire curve will be approximate- 
ly equal to the integral sum 
(4.2). The exact value of the 
mass is naturally defined to be 
the limit of the sum (4.2) as 
‘the length of the largest segment 
tends to zero. 

Thus the line integral of the 
first kind (4.8) gives the mass of 
a curve the linear density along 
which is equal to f (x, y). 


Suppose a_ particle moves 
along the curve L from A to 


B under the force F (x, y) with 
components P (z, y) and @ (a, y). 
To compute the work done by the 
force to move the particle, it is 
natural to divide the curve Z 
into small segments and, assum- 
ing the force to'vary little on each 
sepment, to set the work on each 
segment approximately equal to 
the sum of the products of the 
force components taken at some 
intermediate points by the dis- 
placement vector components. 
In such a Case all the work done 
by the force on the particle to 
move it from A to 8 will be 
approximately equal to the sum 
of (4.2’) and (4.2”). The exact 
value of the work is naturally 
defined to be the limit of that 
sum as the length of the largest 
segment tends to zero. 

Thus the general line integral 
of the second kind (4.3’) gives 
the work done by a force, with 
components P (x, y) and Q (z, y), 
on a particle to move it along the 


J curve L from A to B. 


Remark 1. It is obvious from the form of the sums (4.2), (4.2’) 
and (4.2") that the line integral of the first kind is independent 


g* 
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of the direction in which (from «<{ to 2B or from B to al) the curve 
L is travelled while reversing the sense of the curve results in revers- 
inv the sien, i.e. 


| P(z, ydr= — | P(x, yj) dz, O(n. uydy = — | Q(z, u)dy. 


AR Ba RY: HA 


Remark 2. Quite similarly introduced for the space curve are 
the line integral of the first kind f(x, y, 2) dl and the three line 


a 


AB 
intevrals of the second kind 


| P(x,y,2)de, | Q(x, y, 5) dy, R(x, y, =) dz. 
’ 1B 


AS | 


Po 
ee 


It is customary to call the sum of the last three intevrals the general 
dine integral of the second kind and to designate it by the symhol 


| Piz, y, s)dx+-Q (2, y, s)dy+ Rr, yu, 2) dz. 


AB 


4.2. EXISTENCE OF LINE INTEGRALS AND REDUCING THEM 
JO DEFINITE INTEGRALS 


Let us agree to say that the curve Z is smooth if the functions ¢ (2) 
and tp (/) of the parametric equations (4.1) defining it have on fa, bI 
continuous derivatives ’ (t) and wy’ (0*. 

The curve / will be said to be piecewise smooth if it is continuous 
and splits into a finite number of pieces with no interior points in 
common, each piece being a smooth curve. 

According to what was agreed upon as far back as Chapters 15 
and {6 of [4], the points corresponding to the value of the parameter ¢ 
for which the derivatives m’ (7?) and yp’ (t) both vanish will be called 
singular points of the curve ZL, 

We shall prove that if the curve L = AB is smooth and does not 
contain singular points and if the functions f (x, y), P (xz, y) and 
0 (x, u) are continuous along the curve, then the following formulas 


* By this we mean that the derivatives q‘ (1) and yf’ (#) are continuous at any 
interior point of [a, b] and have finite limiting valuce at the point @ fram the 
right and at 6 fram the left. 
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reducing line integrals to usual definite integrals are valid 


\ it, y)dl= | Pie, y) dz= 
AB AB 


vb 6) 
= | fle (), v@I x = | Pig), pe’ at, (4.4) 


a 


x Vie OPI WP at. (44) | ( Q(2, yay= 
AB 


b 
= | Qfo(e), v(t) W (dt. (4.4") 


At the same time we shall prove the existence of all line integrals occur- 
ring in these formulas, 

We notice first that the definite integrals on the right of (4.4), 
(4.4'), and (4.4”) a fortiori exist (for under our hypotheses the inte- 
grands in each of these integrals are continuous ona <t < J). 

For the line integral of the second kind we shall derive only formu- 
la (4.4’) (for derivation of (4.4”) is quite similar). 

As in Section 4.4 we divide the interval a<c t <b by means of 
the points a=t)<t,<ip<...<t, =D into nm subintervals 
and form the integra] sums (4.2) and (4.2’). 

Now we take into account the fact that 


Al, = Ly — Zp-1 = P (tn) — P (tnt) = 
tp te 

= { Viv @F+9 Oa. | =} o Mat. 
they they 


This allows us to rewrite the expressions for (4.2) and (4.2') as follows 


O1= S{F(@ (te), VTA) X — | O2= D) {PL (ta), (tal X 
k=1 k=1 


th, th 
x | Vie Orr Mra}. | x | oat}. (4.5") 
Th} ha2 


(4.5) 


(We have also taken into account the fact that &, = @ (Tx), Nn = 
= tp (t,), where t) is some value of ¢ satisfying the condition th. S 
ST Sth.) 
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Now denote the definite integrals on the right-hand sides of of formu- 
Jas (4.4) and (4.4) by A, and A., respectively. Dividinga <t<b 
into a sum of n subintervals [t, 1, t,] we may write the “definite 
integrals A, and A. as follows: 


n fy, Tl 4, 
R= liters | k= yd | Pir), eos 
ke} tymt =} toy 
2 Vie (OPS Y? OF dt. x @ (dt. 
Consider and evaluate the differences 
—A,= O.—Ko= 
n , n 
=>) | (fly (m2). T(t =) | {P [4 (ty), Y (Ta) — 
he} thoy A=! Moy 
—fip(t), P(A} x —~Pip(t), F())} P(t) dt. (4.6°) 


KY fe’ (ES Ee Fe dt. (4.6) 


Since z = ¢ (t) and y = tp (¢) are continuous on a<t <b and 
f(z, y) and P (x, y) are continuous along J, by the theorem on the 
continuity of a composite function (seo Section 14.5. of MD 
fl @), p(t)] and P lp (t), pO) are continuous on ax<t<b. 

Now notice that as the largest of the lencths of the subarcs AL, 
tends to zero, so does the largest of the differonces (t;— ¢,_,)*. 
But from this it follows that given any e >O we can find 6 > 0 
such that provided the largest of the lengths Al, is less than 4, each 
of the braces in formulas (4.6) and (4.6) is less than e. Therefore, 
provided the largest of the jousths Al, is less than 6 we obtain for 


ty 
* Indeed Al, = Vie WEY (fF dt. Since qe (ty and (1) are 


‘. 
hey . 
continuous on asxts<b and do not vanish together, the function 
Vig wp ie" CE is continuous and strictly positive — on agicgh. 
Therefore the minimum value m of the last function on a<t<<h is posi- 
ty ; 
tive. But then Al; > m | dows en (ty — tp ay), bee by thi — Abn. 


e 


ty -( 
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the differences (4.6) and (4.6’) the following evaluations: 


nth | 0,.—Kel< 
Jo, —A, [Se > \ x n tr 
Rat thy <e¥ | lv’ @la< 
x Vie’ OP +Iy’ MP dt= iad 
b n tp 
=e | Vig OF TW Wr at= | <eM >) | at=eMt (b—a), 
a k= mt tt, 
= el, 
where 7 is the length of the | where 7 is the maximum value 
curve AB, of |p’ ()] on a=ct=b. We 


stress that in deriving (4. 4’) we 
require only that g’ (é) should 
be continuous and that the curve 
I= AB8 should be rectifiable 
(it is not required that »’ (é) 
should be continuous). 


In view of the arbitrariness of © we may assert that the integral 
sums 6, and of have (as the largest of the lengths AJ, tends to zero) 
limits equal to K, and K, respectively. This simultaneously proves 
the existence of line integrals on the left-hand sides of (4.4) and (4. 4’) 
and the validity of these formulas. 

Remark 1. In the case of the piecewise smooth curve L it is 
natural to define line integrals along the curve as the sums of the 
corresponding line integrals over al] smooth pieces forming the 
curve LZ. Equations (4.4), (4.4') and (4.4") thus turn out to be true 
for the piecewise smooth curve Z too. They are also true in the case 
where f (z, y), P (z, y) and Q (z, y) are not strictly continuous but 
merely piecewise continuous along JZ (i.e. where Z splits into a finite 
number of pieces, having no interior points in common, along each 
of which the functions are continuous). 

Remark 2. Quite similar results and formulas are true also for 
line integrals taken over the space curve £ = AB defined by the 
parametric equations 


r= it), 
y= Y(t), (@St<b). 
= 4 (t), 
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We confine ourselves to writing the formulas 


| f(z, y,c)dl= [ P(r, 4; ajdz= 
AB AB 
a b 
= fie, '(). 24] x = [ Ppt), TO), 24 dt, 
fo Ww Ore ie (OV | Q(z. y, 2) dy= 
x | + [1" (2)? +17" (f))?dt. | Ap 


b 
~ | QO tp (2), 1). ZOD (d) ae, 


| R(x, y,s)ds= 
AB 
¥ 
= \ Rip(t), F(t), ZC) 2! (t) de. 
Qa 
Remark 3. We have established above that the Jine integral of 
the second kind depends on the sense of rotation of the curve J = 
= AB. We should therefore specify 
y what we mean by the symbol 
| P(x, y)dz+Qlz, y)dy (4.7) 
L L 
when £ is a closed curve (i.e. when 
the point B coincides with A). 
Of the two possible directions of 
circulation about a closed contour L 
the direction in which the domain 
inside the contour remains to the 


a| = left of the point tracing around the 
wo, path is called positive*. In Fig. 4.2 
Pig. 4.2 the positive direction is repre- 


sented by arrows. 

We shall assume that in the integral (4.7) along the closed contour L 
this contour is always traced in the positive direction. 

Remark 4. It is easy to show that line integrals have the same 
properly that the usual definite integrals have (the proofs are similar 
to thoso presented in Sections 10.5 and 10.6 of [1]). Under more 
stringent hypotheses, however, these properties follow at once from 
formulas (4.4), (4.4) and (4.4”). 


* By convention such a direction of mation may be ca}led a “counterclockwise 
motion”. 
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We list them as applied to line integrals of the first kind. 

4°. Linear property. If for either of the functions f (2, y) and. 
g (z, y) there is a line integral along the curve AB and if a and f 
are any constants, then for the function [af (zx, y) + Bg (z, y)T 
there is also a line integral along AB, with 


| laf (x, y) +e (a, ydl=a | f(a, y)d+B | e(z, yal. 
AB AB AB 


2°. Additivity. If the arc AB is made up of two arcs AC and CB 
and if for f (z, y) there is a line integra] along AB, then for f (z, y) 
there is a line integral along either of the arcs AC and CB. with: 


J fe, na= Vf, walt | fle na. 
AB AC CB 


3°. The computation of the absolute value of the integral. If there 
is a line integral along AB of f (x, y), then there is a line integral 
along AB of |f (x, y) |, with 


| f(z, yabl< | | f(w,y) fa 
AB AB 


4°, Mean value formula. If f (z, y) is continuous along the curve 
AB, then there is a point M* on that curve such that 


| fle y)db= 7 (MI), 
AB 


where 1 is the length of AB. 
Example 1°. Compute the mass of the ellipse £ defined by the 


parametric equations 

g=acost, 

(O<t<2n) 
ys=Osint 


provided a > b > 0 and the linear density of mass distribution is 


= | Y }- 
The problem reduces to computing the line integra] of the first 
kind \ | y | di. 


Using formula (4.4) we get 
27 

\ jy{dli=6 \ {sint | Va? sin? t-+ 6? cos*t dt= 
0 

L 


at 
=b \ sin t V a? sin?t-+ 5? cos?t gt — 
0 
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ae 
—} \ sint ‘asin? t +b cos*id! = 
) et 
= —b \/ a? — (a? — b*) cos? t d (cos t) + 
0 


4-b | Vat (a8) costed (cost) = 26 (b-+a MSS), 

" 
{ste 
where? e= Hom 


a 
Example 2°. Compute the line integral of the second kind 


T= \ (2? —22y) dr+(y?—2zy) dy 
1. 
where Z is the parabola y = 2°, with —1 <2<1. This parabola 
may be regarded as a curve given by the parametric equations 


z=f, 
| i (CH 1<t< 0). 
= t- 


‘Using formulas (4.4') and (4.4") therefore we have 


| | 
P= | (e205) de4+ | (8-20) 2rae= ~ 4B, 
<3 -{ 


* Recall that the number e is termed eccentricity in analytic geometry. 


CHAPTER 5 


SURFACE INTEGRALS 


In this chapter we shall consider integration of functions given 
on surfaces. In this connection we first investigate the concept of 
surface and the concept of surface area. 


5.14. THE SURFACE 


5.1.1. The surface. The mapping f of a domain* G in the plane 
onto the set C* of three-dimensiona] Euclidean space is said to be 
diomeomorphic if f is a one-to-one correspondence between the points 
of G and G* under which corresponding to any convergent sequence 
{T,,} of points of G is a convergent sequence {i>} of points of G* 
and corresponding to every convergent sequence {1/5} of points of G* 
is a convergent sequence of points {7,} of G. In other words, a ho- 
meomorphic mapping of the domain G onto the set G* is a one-to- 
one conlinuous mapping of these sets. We shall say that G* is the 
image of G under homeomorphic mapping f. 

Consider the following example. Let G be a domain in the Oxy 
plane. (wu, v) the coordinates of a point AY of that domain, z = 
= 72 (1) a function continuous in G, and G* the graph of that func- 
tion. Clearly the mapping f of G onto G* given by the relations 


=u, yY =U. ts =2(u, v) 


is a homeomorphic mapping of that domain onto G*. 

We introduce the concept of elementary surface. 

A set @ of points of a three-dimensional space is said to be an ele- 
mentary surface if it is the image of an open disk G under homeomorphic 
mapping of G into the space**. 

Using the notion of elementary surface we introduce the concept 
of the so-called simple surface. 

Before proceeding we introduce the notion of neighbourhood of 
a point of a set © of an Enclidean space £°. 


* Recall that a domain is a set each point of which is an interior point. 
** We are considering three-dimensional Euclidean space, although we could 
consider Euclidean space of any number of dimensions and speak of two-dimen- 
sional surface in that space. 
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A neighbourhood of a paint V/ of (Dis the common part of @ ane 
the spatial neighbourhood of the point J. 

A set of points of a space is said to be a simple surface ij that sz 
is connected* and any point of the set has a neighbourhood that is an 
elementary surface. 

Note that an elementary surface is a simple surface but that ¢ 
simple surface is not an elementary surface in general. For instance, 
the sphere is a simple hut nonclemen- 
tary surface. 

We formulate the concept of pen- 
eral surface. 

The mapping f of a simple surface G 
is said to he locally houmecomorphic if 
each point of G has a neighbourhood 
that is homeomorphically mapped onto 

Fig. 5.4 its image. 

A set ) of points of a space is snid 
to be a general surface if it ts the image of a simple surface under 
its locally homeomorphic mapping into the space. 

temark 1, Note that neighbourhoods of points on a general surface 
are introduced as the images of neighbourhoods of puints of that. 
siraple surface whose imape is a given general surface. 

Remark 2. A simple surface is clearly a surface without self- 
intersections and without overlappings. A genera] surface may have 
self-intersections and overlappings. For instance, the surface in 
Fig. 5.1 has self-intersections but is the Jocally homecmarphic image 
of a zone of a cylinder and is therefore a general surface. 

5.1.2. The regular surface. We introduce the concept of regufar 
(-times differentiable) surface. 

Al surface D whose points have coordinates x, y, = is said to be regu- 
lar (/:-times differentiable) ij for some ik > 1 each point of D has 
a neighbourhood allowing a k-time differentiable parametrization. 
This means that each of the above neighbourhoads is a hemeemor- 
phic mapping of some elementary domain G** into the plane (u. v) 
by means of the relations 


r=zr(u, vv) yurytuv. = 2 (u, v) (0.4) 


in which the functions 2x (u. cv), y (u, vj), 2 (u,v) are f-times dif- 
ferentiable in G. 
If & = 1, then the surface is usually said to he smooth, 


* Reeall that a seetissaidto be connected ff any two of its points can be 
Joined by a continucus curve consisting entirely of points of thet ect. 
e*« A domain 6 in the plane is said to be elenentary if it is the iinage of on oper 
dizk under homectworphie mapping fF that disk onto the plane, 
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We shall also say that by means of relations (5.1) we introduce 
in a neighbourhood of a point on the surface a regular parametrization 
using the parameters u and v. 

Remark {. If the whole of @ is given by the mapping (5.1) of G, 
then we shall say that a single parametrization is introduced on ®. 

A point of a regular surface is said to be ordinary if there is a regu- 
lar parametrization of some neighbourhood of it such that at that 
point the rank of the matrix 


a—(* Yu *) (5.2) 


Ly Yy Sy 


is two. Otherwise the point of the surface is called a singular point. 

A domain G in the plane is said to be simple if that domain is 
a simple plane surface. For instance, the open ring is a simple do- 
main. 

We shall say that the function f (u, v) belongs to the class C* 
in Gif it is k-times differentiable and if all its partial derivatives 
of order & are continuous in G. 

The following theorem is true. 

Theorem 5.1. Let G be a simple domain in the plane (u, v) and let 
z(u, v), y (u, v), z (u, v) be functions of class C4, k > 1, given in G, 
with the rank of the matrix (5.2) equalling two at all points of G. Then 
relations (5.1) define in space a set D which is a regular k-times differen- 
tiable general surface without singular poiits. 

Proof. Obviously it suffices to show that using relations (5.1) 
gives a locally homeomorphic mapping of G onto @® 

Let AM (xo, Yo. Zo) be any fixed point of D corresponding to the 
values (Wp, Vp) of the parameters (u, v) (Fig. 5.2). Under the 
hypothesis the rank of the matrix A is two at a point (ug, Vo). 


Cy Y 


Suppose for definiteness the determinant | — ~"“ | of the matrix 


b dod 

A is nonzero at that point. Since the determinant is a Jacobian 
D (x, y) 
D (u, v) 
have continuous partial derivatives in G, by the theorem on the 
solvability of a system of functional equations (see Theorem 15.2 
in {l]) there is a neighbourhood A of the point (x, yo) in the 
Oxy plane such that in that neighbourhood there is a unique and 
k-times differentiable solution 


and is nonzero at the point (Wp, Uo) and z(u,v), y (u, Vv) 


u=u(z, y), vD=v(z, y) (5.3) 
of the system 

xz(u, v)—xz~=0, 

y(u, v)—y=0. 
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Fig. 5.2 


It follows from the above reasoning that some neighbourhood H 
of the point (zo, ¥,) is a homeomorphic mapping of some neighbour- 
hood G of the point (9, U9) With the aid of the relations z = z (u, v), 
y = y(u, v) (the inverse mapping of H onto G is effected using 
relations (5.3)). 

Substituting expressions (5.3) for u and v in the relation > = 
= 2(u, v) we see that some neighbourhood @ of a point AY, on @ 
is the graph of the k-times differentiable function 2 = 2 (u (x, y), 
vu(z, y)) = <(z, y). But this means that using the function < (7, y) 
cfiects a homeomorphic mapping of a neighbourhood JJ of a point 
(Zo, Yo) in the Ozy plane onto the neighbourhood @ of the point Aly 
of @. It is obvious that the neighbourhood G of the point (u,, Uo) 
is homeomorphically mapped onto the neighbourhood @ of AJ, on 
@*. In other words @ is the image of G under locally homcomor- 
phic mapping into space and is therefore a genera) surface. This 
completes the proof. 

Remark 2. In the process of proving the theorem we have estah- 
lished that cach point Af, of a surface @ without singular points has 
a neighbourhood @ which is 1-1 projected onto one of the coordinate 
planes and is therefore the graph of a k-times differentiable function 
(this was the function 2 (z, y) in the proof of the theorem). 


* Here we have used the obvious statement that a composition of hamcs- 
morphic mappings results in a homeomarphic mapping. 
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Line uv 


Fig. 5e3 


Figure 5.2 shows points AJ, and Ny whose neighbourhoods are 41-4 
projected onto the Oxy and Oxz planes respectively. 

5.1.3. Defining a surface by means of vector functions. Consider 
a regular surface @. It is a certain set of points AJ with coordinates 
(2, y, 2) (Fig. 5.3). Denote by r (M) the vector going from the origin 
to a point JY on the surface. Clearly r (M) is the vector function 
of a variable point Af of the surface*. It is usually called the radius 
vector of the surface @. 

Let us turn to that neighbourhood of the point AZ which is the 
image of a homeomorphic mapping (5.1) of some elementary do- 
main G** (in Fig. 5.3 the neighbourhood is encircled by a broken Jine). 
Then clearly the coordinates x (u, v), y (u, Vv), 2 (u, Vv) of the point AJ 
are the coordinates of the vector r (MM). Itis obvious that in this neigh- 
bourhood the function 7 (47) is the function of the variables u and v: 
r(M) =r (u, v). With the value of v fixed, the terminal point of 
the’ radius vector r (u, v) traces in the neighbourhood under consid- 
eration a curve called acurve u (oracurve v = const). With the value 
of u fixed, the termina] point of r (u, v) traces a curve v (or u = 
= const). These curves u and v are called coordinate curves on the 
surface @ in the neighbourhood under consideration. 

A system of coordinate curves u and v may thus be introduced in 
some neighbourhood of each point of the surface @. It is sometimes 


* A vector function may be regarded as a collection of three scalar functions. 
Detailed information on vector functions is given in Section 12.1. We shalf 
use it as need arises. 

** A domain G in the plane is said to be elementary if it is the homeomorphic 
image of an open disk. 
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called a system of curvilinear coordinates on the surface (more precisely, 
in the neighbourhood under consideration). 

Jn Section £2.1 a geometrical interpretation of the derivatives r,, 
and r, of the vector function r (a, v) is given. These vectors are 
the tangent vectors of coordinate curves (see Fig. 5.3). 

Using the vectors r, aud r, one can see the geometrical meaning 
of the ordinary and the singular point of the regular surface. 

Recall that a point VW of a surface is said to he ordinary tf in 
a neighbourhood of the point we can introduce a parametrization 


Singular point Line of singular potnts 


MEP 


Pip. 5.4 


with the aid of equations (5.1) such that the rank of the matrix A 
{see relation (5.2)) at the point is 2. Since the rows of A consist of 
the coordinates of r, and r, and tho rank of A is two, ry, and r, 
are linearly indopendent. So the ordinary point is characterized by 
the possibility of introducing in a noighbourhood of the point a 
parametrization such that ry, and r, are linearly indopendent at 
(he point J. 

In Fig. 5.3 Jf is an ordinary point of the surface @., Figure 5.4 
represents the surfaces with singular points. 

5.1.4. The tangential plane and the normal to a surface. One-sided 
and two-sided surfaces. We have already introduced the concept of 
tangential plane to a surface which is the graph of the differentiable 
function < = 3 (x, y) (see Section 14.4.2 of 1h). Recall that the tan- 
gential plane at a point Jf, was defined to be a plane having the 
property that the angle betweon that plane and the secant WWol 
(AF being an arbitrary point of the surface) tends to zero as JF tends 
to VW. We have proved that if s (z, y) is a function differentiable 
at a point (29, Yo), then there is a tangential plane at the point 
Moy (Tp, lov # (To, Yo)). 

We shall show that there is a tangential plane at any ordinary 
point af a smooth surface. To do this it is clearly enough to ostablish 
that some neighbourhood of the ordinary point of the surface is 
the graph of the differentiahle function. But in Section 5.4.2 (see 
Remark there) we proved this property for any ordinary point of 
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a smooth surface. Consequently, there is a tangential plane at any 
ordinary point of a smooth surface. 

Remark 1. It follows from the definition of the tangential plane 
to a surface M that the tangent line at a point M, to any smooth 
curve* situated on the surface and passing through M, lies in the 
tangential plane to D at M,. Since yr, and r, are tangential to the 
curves wu and v passing through M,, they are in the tangential plane 
at M,. 

We introduce the concept of normal to a surface @ at a point Mo. 

The normal to a surface DM at a point M, is a straight line passing 
through AZ, and perpendicular to the tangential plane at M/,. The 
normal vector to the surface at MM, is any nonzero vector collinear 
with the normal at M5. 

Let M, be an ordinary point of a smooth surface M and suppose 
that some neighbourhood ® of that point is defined using a vector 
function r (u, v) such that the vectors r, and r, are not collinear 
at (7). Then clearly the vector 


N = {rur,] (5.4) 
is the normal vector to @ and the vector 
__ [Pury] 
= }{rurn] | (9.9) 


is the unit normal vector to ®. 

Remark 2. Since under the hypothesis the surface is smooth, 
the vector function WV (u, v) and the vector function nv (u, v) defined 
by relations (5.4) and (5.5) respectively are continuous. There is 
thus a continuous normal vector field at some neighbourhood of each 
point of a smooth surface. 

The question naturally arises: is there global continuous vector 
field of normals on any smooth surface? It turns out that there are 
surfaces on which there are no global continuous vector fields of 
normals. An example of such a surface is the so-called Mobius strip** 
depicted in Fig. 5.5. (This surface is obtained from the rectangle 
ABB'A' by pasting together the sides AB and A’B’ in such a way 
that the points A and 8’ and the points A’ and B coincide, see 
Fig. 5.5.) 

A surface on which there is a global continuous vector field of 
normals is called two-sided. A surface on which there is no such global 
field is called one-sided. 

The plane, sphere, ellipsoid, one-sheeted hyperboloid are two-sided 
surfaces, the Mobius strip being a one-sided surface. 


* A curve Z is said to be smooth if it can be given using the vector function 
r (t) of class C+ for which r’ (t) s& 0 (for more detail see Section 12.2). 
** A, Mobius (1790-1868) is a German mathematician. 


9—01684 
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We shall consider only two-sided surfaces in what follows. 

9.4.5. Auxiliary lemmas. Here we prove some statements we 
sha}] need in further discussion. 

Lemma 1. Let Af, be an ordinary point of a smooth surface @, 
Then some neighbourhood of My, is 1-1 projected onto the tangential 
plane drau-n at any point of the neighbourhood. 


Proof. We show that for instance that neighbourhood ® of Jy, 
possesses the property pointed out in the Jemma in which the normal 
at any point makes with the normal at AJ, an angle less than a/4 


Fig. 5.6 


and which is 1-1 projected onto some 
disk in one of the coordinate planes 
(for instance, inthe Ory plane)*. Note 
first that the normals at any two points 
of (@ make an angle Jess than 2/2. Fur- 
ther let @ lack the above property. 
Then for some point 17 of @ we can 
find points P and Q of @such that the 
chord PQ js paralle) to the normal rs, 
at J (Fig. 5.6). Consider the line of 
intersection of with a plane paral- 
lel to the Oz axis and passing through 
PQ. By the choice of the neighbourhood 
(PD the portion PNQ of the line lies in @ 


andis the graph of the differentiable function given on the segment 
which is the projection of PQ onto the Ory plane. By the Lagrange 
theorem the tangent at some point NV of PNQ is parallel to the chord 


*" The possibility of choosing such a neighbourhood @ stems from the 
following considerations. In Section 5.4.4 we noted (see Remark: 2) that 
there is a continuous vector field of normals at some neighbourhood of an 
ordinary point on a surface. In a sufficiently small neighbourhood of Af, 
therefore the normals make with the normal at Af, an angle less than a/4. 
We have also established that some neighbourhood of af, ts 1-1 projected 
onte a coordinate plane. In that neichbourhood there is clearly a part that 
is projected onto some disk in the coordinate plane. 
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PQ and therefore to the normal ny, at J7. But then the normal at 
N perpendicular to that tangent makes an angle of n/2 with the 
normal at JAZ. But this is not possible since the normals at any two 


points of M (including Jf and NV) make an angle less than n/2. The 
contradiction obtained shows that the lemma is valid, which com- 
pletes the proof of the lemma. 

We introduce the concept of complete surface. A surface M is said 
to be complete if any fundamental sequence of points of © converges 
to some point of the surface. 

The plane, sphere, ellipsoid, one-sheeted hyperboloid are examples 
of complete surfaces. The open disk, any open connected set on a 
sphere are incomplete surfaces. Bounded complete surfaces and bound- 
ed closed parts of complete surfaces will be called bounded complete 
surfaces in what follows. 

We shall say that the part of M has a size less than 6 if it can be 
placed in some sphere whose diameter is less than 6. 

The following lemma holds. 

Lemma 2. Let ® be a smooth bounded complete surface without 
singular points. There is 6 => 0 such that any part of D whose size 
is less than § is 1-1 projected onto the tangential plane passing through 
any point of that part. 

Proof. Suppose the statement of the lemma is false. Then for 
any 6, = 1/n, n = 1, 2,..., there is a part D, of M whose size is 
less than 6, and which is not 1-4 projected onto the tangential plane 
at some point of it. Choose in every part @, a point M],, and select 
a subsequence of {//,,} converging to some point 7, of D*. Consider 
a neighbourhood of M, satisfying the conditions of Lemma 1. 
With n sufficiently large, the neighbourhood will contain all parts 
@,. But then every part should be 1-4 projected onto the tangential 
plane (at any of its points) but this contradicts the choice of parts @,. 
The proof of the lemma is complete. 

The following lemma is true. 

Lemma 3. Let © be a smooth bounded complete surface without 
singular points. There is 6 > 0 such that any part of © whose size is 
less than 6 is 1-4 projected onto one of the coordinate planes. 

The proof of the Jemma is closely analogous to that of Lemma 2. 

Lemma 4. Let D be a smooth bounded complete two-sided surface 
without singular points. Then given any ¢ > 0 we can find 6 > 0 such 
that for the cosine of the angle y between the unit normal vectors at any 


two points of an arbitrary part ® of the surface whose size is less than 5 
we have the representation 


cos y = 1 — Go, (5.6) 
where | aq | <&. 


* Since ® is a bounded complete surface, such a subsequence can be chosen. 


9% 


{32 Fundamentals of Mathemetical Analysts 


Proof. Consider a vector field of unit normals rn (3/) continuous 
on (such a field exists, since Gis a two-sided surface). The vector 
function ris uniformly continuous since @ is a bounded complete 
surface and hence a bounded closed sot. Given any e > 0 therefore 
we can find 6 > 0 such that for two arbitrary points 3/, and 1, of 
W) the distance between which is less than 6 we have 


ln (M.) —n(ah) |< V2. (5.7) 
Since 


cog = 1S (n (Ma)—n (MT), 


putting 
Ctay (Mg) = (MY? 


and using inequality (5.7) we see that relations (5.6) are true. This 
completes the proof of the lemma. 


5.2, SURFACE AREA 


5.2.4. Surface area. Let © be a bounded complete two-sided 
surface. Divide @ with piecewise smooth curves into a finite number 
of parts , each 1-1 projected onto the tangential plane passing 
through any point of that part**. Denote by A the maximum size 
of the parts (D, and hy o; the area of the projection of @, onto the 
tangential plane at some point JJ; of the part @,. Further form the 


sum >, 0, of all the areas. 


We state the following deftnitions. 
Definition ft. The number o is said to be the limit of the sum Doh 


{ 
as A —Oif given anye >Owecan find § > 0 such that for all subdi- 
visions of MD with ptecewise smooth curves into a finite number of parts 
qp, for which A < 6 regardless of the choice of points M,on the parts QD, 


|Nop—ol<e. (5.8) 
i 


ans 
* We have used the following relations: 


pee (Af) = 4, n? (Af.) == 4, R (Af4) n (Jf) == C08 VY, 
A (n (Ma)—7 Onyyr=— (n? (Sf,)—2n (Ala) n (My) +n? (M,)). 


** The possibility of such a subdivision is guaranteed by Lemma 2 of Sce- 
= i 
tjo3) 5.1.0. 
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Definition 2. If for a surface @ there is a limit o of the sums > or 


i 
as A --0, then the surface is said to be squarable and the number o 
is said to be the surface area. 

Our immediate task is to find sufficient conditions for the squara- 
bility of a surface. We shall prove that smooth bounded complete 
two-sided surfaces are squarable., We 
shall simultaneously show the computa- 
tional techniques used to compute surface 
areas. 


At first sight it would be natural to approach 
the question of computing the area of a surface 
using approximation of the surface with poly- 
hedra. This way is ineffectual. however. We 
shall show an example due to Schwarz* which 
demonstrates that the areas of polyhedra in- 
scribed into a smooth surface may increase un- 
boundedly as the number of faces increases and 
their size decreases. 

Let ® be a zone of a cylinder (Fig. 5.7). 
Divide @ with circles parallel to the bases of 
® into n equal parts. Next divide each of such 
circles into m equal parts as shown in Fig. 5.7. 
Depicted in the figure is also a polyhedron ®,,_, 
inscribed into ®. With any m fixed. the area 
of that polyhedron {D,,, clearly exceeds by a 
factor of n that of the projection of the polvhedron onto the plane of the cylin- 
der base. Since the projection does not depend on n, by increasing n. with any 
m fixed, the area of the polyhedron ®,,, may be made arbitrarily large. 


5.2.2. Squarability of smooth surfaces. We shall prove the follow- 
ing theorem. 

Theorem 2. A smooth bounded complete two-sided surface without 
singular points is sguarable. 

Proof. Suppose asingle regular parametrization may be introduced 
on a surface @. In that case the radius vector r (./) of a variable 
point @ of the surface is the function r (u, v) of the class C1** given 
in some closed bounded domain © in the plane of variables u and uv. 
The partial derivatives r, and r, of r (u, v) are continuous vector 
functions independent of the choice of Cartesian coordinate system 


in space. The value co of | [{rz, rp} | du dv therefore is indepen- 


a 
dent of the choice of Cartesian coordinate system in space. We prove 
that @ is squarable and that its area is co. 


* H.A. Schwarz (1843-1921) is a German mathematician. 
** By thisit should be understood that each component of r (u, rj) belongs 
lo . 
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Let © be an arbitrary positive number taken to be fixed in further 
reasoning. Find for that ¢ >0 a number 6 > 0 proceeding from 
the following requirements: (1) any part @, of @) whose size is less 
than 6 is projected uniquely onto the tangential plane at any point 
of the part (D,; (2) the cosine of the angle y between the unit normal 
vectors at any two points of the part @, may be represented by 


cosy = 1 — am,, (5.9) 


where [ ap, |<e/o and [am, |< 1. The possibility of such a 
choice of 6 > 0 is guaranteed by Lemmas 2 and 4 of Section 5.1.35. 

Consider an arbitrary subdivision of ® by means of piecewise 
smooth curves into a finite number of parts @, whose maximum size A 
is nol preater than 6. Since there is a single parametrization on @, 
corresponding to this subdivision of @ into parts @, is subdivision 
of a domain © into parts Q,;. On every part @, choose an arbitrary 
point 7; and denote by o, the area of the projection of a part @, 
onto the tangential plane at j/;. To compute a, proceed as follows. 
Choose a Cartesian coordinate system so that its origin coincides 
with /,;, the Oz axis is directed along the normal vector to the sur- 
face at ./, and the Oz and Oy axes are in the tangential plane. In 
our coordinate system the surface is defined by the parametric equa- 
tions 2 = xr(u, v), y = y(u, v),s = s(u, v), and the vector (ry, rel 
has the coordinates {4, B, C}, where 


Gu mu 


Yo Sp 


mu Ly Zu Gu 


Zo Ve 


A= , B= , C= , (5.10) 


“cr te 


Note that for the points of a part @,, in view of the choice of 6 and 
the orientation of the Oz axis, the number C is positive, C > U. 
Also note that the cosine of the angle ys; between the normal at 
a point 1 of @, and the Oz axis is 


© 
U{rure} i * 


(5.41) 


COS VM — 


}t is clear that yy, is the angle between the normal]s at the points Af 
and .V/, of the part @, and therefore representation (3.9) is true for it. 


We now turn to the integral | | V(rur,) | du dv which is clearly 
“nt 


f 2 «6 
independent of the choice of Cartesian coordinates in space. Using 
the positivity of C and the third of the formulas (5.10) we get 


Lirare] [dude = | | {ert i‘ is du dv. (5.142) 
$2, Q; Fe Ue “tr oofp 
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Applying to the integral on the right of (5.12) the first mean value 
formula in generalized form we have 


\ Z (z, = 49 
[ {I frure) laude (Frets ([SE laud, 6.18) 
Q; Zp uel Jar BE 
where 17 is some point of ;,. 


Since 


{{rurol | _ 1 
Zu Vu COS Yur 
Zp Up Rf 


{see (5.10) and (5.11)), and \) te ; 


dudv=o;, from _ for- 


mula (5.13) and representation: (5.9) for cos yy, we find that 


o:= | \ [Irure | due dv— || co, l{rutp] | du dv. (5.14) 
‘9? 2) 
Adding up equations (9.14) for all parts @, and considering that 
> | | red du do = = \ t trurll du dv=6 we get 
t “2 


Banc—F | te, | (rure] | du dv. (5.45) 


Evaluate the last term on the right of (5.15). We have 


>} |) as (urd [au do] <>, | [f 10, rare) Lae ae < 
<32 \ f [rely] | dudv=—. O=E. 


From this and from equation (5.15) we get 
|So,—o|<e. 
3 


Thus the surface @ is squarable and its area equals o. 

We have considered the case where it is possible to introduce 
a single parametrization on @. In the general case D may be divided 
into a finite number of parts, each allowing a single parametrization 


* We have used the formula for the area of a plane domain in transforming 
from coordinates (z, uv) to coordinates (u, v) using the relations 2 = x (u, v), 
y=u(u, v). 
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to be introduced*. After that the area of the surface may be defined 
as the sum of the areas of those parts. This completes the proof of 
the theorem. 

Remark 1. Let the surface @ be piecewise smooth, i.e. made up 
of a finite number of smooth bounded complete two-sided surfaces. 
Tho surface @ is clearly squarable, its area may be defined as the 
sum of the areas of the constituent surfaces. 

Remark 2. Jn the process of proving Theorem 5.2 we have estab- 
lished that if it is possible to introduce a single parametrization 
on @ and if the domain of the radius vector x (u, v) of @ is a closed 
bounded domain Q in the plane (uv, v), then the area o of @ can be 
found from the formula 


o= \ \ [ [rune] | du dv. (5.16) 
a 
If2=a2x(u, v), y = y (u,v), 2 = 5 (u, v) are parametric equations 
of the surface, then the vector [r,r,] has the coordinates {A, B, C} 
defined by rolations (5.10). Since |[ryr,] | = A? + BE + C3, 
formula (5.46) may be written as 


o= || VEER EC dua. (5.17) 


i 


Using the symbols 

ri=E, ry ly =F, r5=G 
and the formula 

lL [rury) | = V rirt—(rury)*, 
expression (5.16) for the aren of a surface may also be written as 


= | V EG — F* du dv. 0.18) 


a 


Remark 3. The area of a surface has the additive property: if 
a surface @ is divided by a piecewise smooth curve into parts D, and W, 
having no interior points tn common, then the area a of @ Is equal 
to the sum 0, + Gy of the arcas of M, and @.. This property follows 
from the representation of area using the integral and from the addi- 
tive property of the integral. 


* We may use for example Lemmn 3 of Section 5.1.5. By this lemma @ can be 
divided into a finite number of parts, cach of which is 1-1 projected onto some 
coordinate plane and thus is the graph of a differentiable function. 
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5.3. SURFACE INTEGRALS 


3.0.1. Surface integrals of the first and the second kind. Let ® be 
a smooth bounded complete two-sided surface. Let a function 7 (1) 
of a point j7 be given on M. Denote by rn (./) a continuous vector 
field of unit normals to @. 

We divide M with piecewise smooth curves into parts ©, and 
choose on each of such parts an arbitrary point 4J;. We introduce the 
following notation: A is the maximum size of the parts @,, oc; is 
the area of M;. X:, Y;, Z; are the angles a vector » (i/;) makes 
with the coordinate axes. 

Form the following four sums: 


I{®,, j= > (MI) o1, (5.49) 
I {®,, 41;, Z;};= ~ j (AT;) cos Z;0;, (5.20) 
1{®;, My, Y= 2 f(37;) cos Yya;, (8.24) 
I{®;, M;, X;}= Sf (41;) cos Xq0;. (5.22) 


3 


For each of these sums we introduce the concept oi limit as A > 0- 
We formulate this concept for the sums (3.19). For the sums (3.29); 
(5.24), and (5.22) the notion of limit is formulated in a similar way- 

Definition. The number I is said to be the limit of a sum /{®,, -i7;} 
as A —OQif given anye > 0 we can find§ > 0 such that for any subdi- 
visions of the surjace M with piecewise smooth curves into a finite number 
of parts D, whose mazimum size A is less than 4, regardless of the choice 
of points Af; on parts D; we have 


[l{D;, A i}—I | <e. 
The limit J of asum /{@;, -1f;} as A > 0 is called a surface inte- 
gral of the first kind of the junction 7 (\J) over ® and designated 
T= \\ ¢(anjac- (5.23) 
‘o 
If (z, y, z) are the coordinates of a point 17 on @, then for f (1) 


we may use the symbol f (z, y, s). In that case formula (3.23) may 
be written as 


I= { 5) i (z, y. 2) de. (5.24) 


The limits of the sums /{@;, M;, Z;}, [{M;, \;, Y;}. and 
T{M;, 3f;, X;} as A —O are called surjace integrals of the second 
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hind of the function fj (Af) over @. For these integrals we use the fol- 
lowing symbols respectively: 


| ¢NcosZeo, | [7 Noos¥ do, || f(aljcosX ao 


® es 8 * e 
@ md @ 


or symbols similar to (5.24). 

Remark 1. The definition of the surface integral of the first kind 
implies the independence of the integral from the choice of orientation 
of the vector field of unit normals to the surface or, aS we say, fram 
the choice of side of the surface. 

Remark 2. The surface integral of the second kind depends on the 
choice of side of ihe surface: reversing the orientation of the vector 
field of unit normals reverses the sign of all the three surface inte- 
grals of the second kind. This is due to the fact that in each of the 
sums (3.20), (5.21), and (5.22) the values of f (\7;) and o, remain 
unchanged under a change of orientation, and the values of the co- 
sines of the angles the normal r (.\f,) makes with the coordinate axes 
change sign. 

Remark 3. Once a definite side of the surface is chosen, the surface 
integrals of the second kind may obviously be regarded as the surface 
integrals of the first kind over @ of the functions f (AJ) cos Z (47), 
{ O07) cos Y (MM), f (31) cos X (1) respectively. Indeed, once a defi- 
nite side of the surface is chosen, cosZ, cos Y, cos X are the functions 
of the point Jf of @. 

5.3.2. Existence of surface integrals of the first and the second kind. 
Let (@ be a surface satisfying the conditions stated at the beginning 
of Section 5.3.1. Choose on @ a definite side. By Remark 3 of Sec- 
tion 3.3.1, once a definite side of @ is chosen surface integrals of the 
second kind may he regarded as integrals of the first kind. Therefore 
we shall state sufficient conditions for existence only for the integrals 
of the first kind. 

The following theorem is true. 

Theorem 3.3. Let it be possible to introduce on a surface D a single 
parametrization by the function 


xa=az(u, v) yrey(u, vj), c= zu, v) (4.25) 
given in a bounded closed domain Q of the plane (u, v) and belonging 


to the class C' in that domain. If the function f (MM) = f (z, u, 2) 
is continuous on (D*, then the surface integral of the first kind of that 


* The concept of continuity of the function of a point Jf given on some set 
{3/} in space was formulated in Section 14.3.4 of [4]. In the care under con- 
sideration the role of {37} ic played by @. 
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function over MD exists and can be evaluated from the formula 


I= j f(M) do= 


= \ \ f(x(u, v), y (u, v), z(u, v)) V EG — F? du dv*. (5.26) 
2 


Proof. We want to prove that given any e > 0 we can find 6 > 0 
such that for any subdivision of ® with piecewise smooth curves 
into a finite number of parts ®; for which A < 46, regardless of the 
choice of points 47; on the parts @; we have 


|T{®, Mj}— 


~{{ Few. (u,v), 2, 2) VEC—Pduav|<e. (5.27) 
“9 


Let ¢ be any fixed positive number. Choose for this e > 0 a number 
6* > 0 so that the following two conditions should hold: 


(1) For any two points (u;, v;) and (u;, v,;) of Q a distance less 
than 6* apart, 


\VE (Wy, 01) G (U4, 01) — F? (U4, ¥4)— 

-VE (uy, Gy, )—F (0) |< aap (5.28) 
where A is a positive number exceeding the maximum of the function 
| f (4) | and P is the area of Q; 

(2) For any subdivision of Q with piecewise curves into a finite 


number of parts 2, whose size is less than 6* and for any choice of 
points (u;, v;) in every part Q; 


| » f(z (te, Vi)s y (Uj, V4); z (uz, V1) x 


x V Ey, vi) G (Uy, v,)—? (wz, vs) oF — 
—~ | \ fiz, v), y(u, v), 2(u, v) VEGF dudv|<z, (5.29) 
J : 


where oj are the areas of parts ;. 
The possibility of the required choice of 6* is guaranteed by the 


property of uniform continuity of the function VEG — F? continuous 


* f(x(u, v), y(u, v), 2(u, v)) is the function obtained by superposing 
f(y, 2) and z= 2 (u, v), y= uy, vs s =: (u, v). By the theorem on the 
continuity of a complex function this function is continuous in 2. 
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in the bounded closed domain Q and by the property of integrability 
of the function f (z (u,v), y (u, v), 2 (u, v)) VEG — F? continuous 
in Q, 

We find 6 > 0 for 0* > 0 so that any subdivision of © with 
piecewise smooth curves into a finite number of parts ©, whose 
sizes are Jess than 6 has a corresponding subdivision of Q into a finite 
number of parts Q; whose sizes are Jess than 6*. The possibility of 
a choice of such 6 is guaranteed by the fact that @ is a homeomorphic 
mapping of 2 and therefore corresponding to each subdivision of @ 
with piecewise smooth curves into a finite number of parts W, is a 
subdivision of 2 with piecewise smooth curves into a finite number 
of{parts Q,. If the maximum size of the parts @, tends to zero, then 
so,does the maximum sizo of the parts Q,. 

Now consider subdivision of @ with piecewise smooth curves into 
a finite number of parts @, whose maximum size A satishes A < 6, 
whero 5 > 0 is chosen for 6* in the way shown above. Form for this 
subdivision a sum /{W,, 4/,} using (5.19). Since the area o, of 


nm part qd), is { VEG — F2 du dv, donoting the coordinates of 
‘ot 
a point AJ, in the part M, by (x (uy, Us), Y (Ur, Uy), F (tty, ¥,)) We get 
f(D, AL,) = 


= »> ] (z (tly, Us), Y (iy, v1), = (ui, v;)) \ \ VEG — F? du dv. 
i Q; 


Using the mean value theorem for the integrals on the right of the 
last relation we clearly may transform that relation as follows: 


1{M,, AL,}— | f(x (u,v), Qt, v), (u,v) V BG— PF? du dv= 
"Q 


= [ y f(z (ty, V4), i] (22, u'r), (uy, vr)) “A 


” }- E (uy, v1)G (Us, U,)— Fe (uy, v,) oi — 


— | | f(z(u,v),y (u,v), s(u, v)) VEG— FP du dv |+ 
‘a 


+ DSA x (Uz, Vi), y (tty, Ur), o (tty, V;)) rae 


™ L/ E (Uy, U1) G (tay, VY) — FA (uy, Uy) — 


_ 0 E (uy, VU) G (uy, Uy) — Fe (uy, v1) of. 
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From the last equation, via inequalities (5.28) and (5.29), we easily 
obtain inequality (5.27). The proof of the theorem is complete. 
Remark 1. To compute the surface integral of the second kind 


| f(z, y, 2) cos Z do once a definite side of M is chosen, we can 
“® 
clearly use the following formula: 


[ \ f(x,y, z)cosZdo= 
"® 


a een | 


= \ \ f(x(u, v), y (u,v), z(u,v) VY EG—FrcosZdudv. (5.30) 


2 


Similar formulas are true for the other two surface integrals of the 
second kind. 

Remark 2. Let a surface @ be the graph of the function z= 
=z(z, y) belonging to its domain of definition, D, to the class C}. 
Choose on ® the side for which the unit normal vector rn (AZ) 
of ® makes with the Oz axis an acute angle. In that case cosZ= 
a ae 

Vi+P?+¢ 
nuous function R(z, y, z) is given. Considering then that z and 
y are taken to be the parameters uw and v on ® ( is defined 
by the parametric equations z=z, y=y, Z2=2(z, y)) and 
V £G— F2=V1-+ p?+¢?, we may rewrite formula (5.30) as 


Oz a : 
, where p=, q=FC- Suppose on @ a conti- 


R(x, y, 2) cos Z do= | Ria, y, 2(2, y)) Vi+ p?+ qx 
a 68 D 


ra) 


1 
x Vinee UU j R (x, y, 2 (x, y)) dz dy. 


This remark explains why we have the following notation for the 
surface integral of the second kind: 


\ R(z, y, 2) cos Z do= \ R(z, y, 2) dzdy. (5.34) 


Note that (5.34) is used also in the case where @ is not the graph of 
the function z = 2 (z, y). 
We shall consider surface integrals of the second kind having the 


following form: 


| \ (P cos X + Qcos Y + R cos Z) da. 
“o , 
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Such integrals will also be designated as follows: 


| | Pdydz+Qdzdz+ Rdzdy. 
‘ 


Remark 3. The concepts of surface integrals of the first and the 
second kind can naturally be extended to the case where @ is a picce- 
wise smooth surface. For such surfaces clearly the existence theorem 
proved in this subsection is also true. 

9.0.0. surface integrals of the second kind independent of the 
choice of Cartesian coordinate system. From the definition of the 
surface integrals of the first and the second kind it follows that the 
integral of the first kind is independent of the choice of Cartesian 
coordinate system in space, whereas the integrals of the serond 
kind are dependent of its choice, for changing the coordinate system 
changes the values of the cosines of the angles which the normal rn (VV) 
makes with the coordinate axes. 

In the case where a vector function is given on the surface we can 
show a more general approach to the notion of surface integral of the 
second kind which allows us to speak in a sense of the independence 
of the value of the integral of the choice of Cartesian coordinate 
system In space. 

So Jet @ be a smooth bounded complete two-sided surface on which 
a continuous vector function r (3) is given. Choose a definite side 
on () and denote by xv (A/) the vector field of unit normals to (. 

Obviously the scalar product r (AT) n (47) is a continuous scalar 
function given on (DP) and is therefore independent of the choice of 
Cartesian coordinate system in space. Consequently, the surfaco 
integral of the first kind of that function 


\ (Al) 2 (AT) do 

“ow 

is independent of the choice of Cartesian coordinate system in space. 
Use the coordinate notation of the scalar product r (J/) n (AM), 


assuming that the vector r (\7) has the coordinates P, QO, 2. Since 
the coordinates of n (AJ) are cos X, cos Y, cos Z, we have 


r(V) nV) = PcosX + QcosY + ReosZ 


and therefore 


| r(M)n(M) do == | { (P cos X -+-Qcos Y +- Reos Z) do. 
oe . 


~ 


3 @) 


The integral on the right of the Jast equation is the sum of the three 
surface integrals of the second kind and is usually called the general 
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surface integral of the second kind. Consequently, \ [ r (Af) n (M4) do 


© 
may also be called the general surface integral of the second kind. 
Remark 41. If on the surface @ the three scalar functions P, Q, 


and & are given, the integral [ \ (P cos X + Qcos Y+R cos Z) do 


® 
may be written in the form invariant (independent) of coordinate 
system, assuming P, Q, and # to be the coordinates of some vector 
function r (JZ) given on the surface and writing the integral in the 


form [ \ r (M) n (AM) do. Note that thereby we impose a certain 


Jaw of transforming the integrand in transition to a new Cartesian 
coordinate system. In this case we obtain new coordinates of the 
vector r (7) which are calculated by the well-known rule of analytic 
geometry. However, this invariant form of notation for the surface 
integral is very conveniently used in various applications. 
Remark 2. Note that the general surface integral of the second 


kind ( ) r (M1) n (MW) do is numerically equal to the quantity 


® 
called by physicists the flux of the vector r (17) through the surface @. 


CHAPTER 6 


BASIC FIELD THEORY OPERATIONS 


This chapter considers scalar and vector fields. It investigates basic 
operations of field theory. 


6.4. TRANSFORMATIONS OF BASES AND COORDINATES. 
INVARIANTS 


6.1.1. Conjugate vector bases. Covarian{ and contravariant coor- 
dinates of vectors. Lot ry, § = 1, 2, 3, be a vector basis of a threo- 


dimensional space* (for the plane, i assumes values of 1 and 2), 
The basisr", k = 1, 2, 3, is said to be conjugate to the basis r, if** 


1, i=h, 
0, issih, 
The symbol 6% is called the Kronecker symbol***, 


The question arises as to the existence and uniqueness of the conju- 
gate basis. The answer to this question is yes: for a given basis r; 
there exists a unique conjugate basis r*. 

We shall see for example that a vector r' is uniquely determined. 
According to (6.1) it is orthogonal to the vectors r, and rs. This 
uniquely determines the line of action of r!, Next from the condition 
ryrt = 1 the vector r! itself is uniquely determinod. Similarly, the 
vectors 7° and r? are uniquely constructed. To show that r', r*, r° 
form a basis, it suffices to prove that r'r*r?7 =~ 0. By the thearem 
on the product of determinants 


rim = St =| l, k= 1, 2, 3. (6.1) 


ryt ry? yr 100 
ryrora) (riptr3) ==t ror) rer? rare l=10 4 Ol 14. 6.2 
ols 2 9 2 

rar! rgr® rgrs 00 4 


* Recall that r;, re, rs form a basis if they are noncoplanar, I.c. if their 
triple product ryr.r, is nonzero. 

** Throughout this chapter ab denotes the scalar product of vectors a and b, 
ave denotes the triple product of vectorsa, B, and c, and [ab] symbolizes the vect- 
or product of a and b. 

*** Leopold Kronecker (1823-1891) is a German mathematician. 
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Since rrr; ~ 0 (r,, re, rs form a basis), it follows from (6.2) that 
riy’*r3 — 0 either. 

Remark {. If a basis r; is orthonormal, then the conjugate basis 
rk coincides with the given hasis r;. 

It is easy to see that the vectors r® of the conjugate basis in three- 
dimensional space can be found using the relations 


i — [ Pats] 2 [737°] r3 = [7-172] 
ryPol's ’ ryPels 7 Tots ° 
Let r;, r® be conjugate bases, and let x be an arbitrary vector. 
Expanding x with respect to the basis vectors we get 


e=a2r't+ nr? t-agr, c= ziry tore t+ zr. (6.3) 


The numbers 2,, 22, 3 are called covariant coordinates of the 
vector z, and z!, x*, z° are the contravariant coordinates of x. These 
terms are made clear in Section 6.1.2. 

To abbreviate the notation of formulas containing terms of the 
same type (relations (6.3) may serve as an example of such formulas), 
in what follows we shall employ the following convention. Suppose 
there is an expression made up of multipliers. If the expression has 
two identical literal indices, one superscript and the other subscript, 
it is considered that summation is taken with respect to these indi- 
ces: the indices are successively assigned the values of 1, 2, 3 and 
the resulting terms are then added up. For example, 


aris ariter?+aor8, 6)=6'+ 82+ 83, 
Bipvic® = (24,012) + (8o,27x*) + (B9,292") = 
= (84,0'x! + Byox'x* + gygx'x) + 
+ (o,t7x! + Soor*x? + gogx*x?) 4- (85,232! + goor3x2-+ gy,25X%). 


Using the summation convention formulas (6.3) can be written 
in the following compact manner: 


eax, xc=2'r;. (6.4) 


Remark 2. The superscripts and subscripts mentioned in the 
summation convention are usually referred to as dummy indices. 
It is clear that dummy indices may be denoted by any letters, the 
expression they occur in remaining unaffected. For instance, 2;r 
and x,7°* represent the same expression. 

Remark 3. All arguments in this subsection referred to the case 
of three-space. In the two-dimensional case the literal indices take 
the values of 1 and 2. 

We obtain an expression for the covariant and contravariant 
coordinates of the vectors. To do this write the scalar product of 
the first of the equations (6.4) by r; and that of the second by r*- 


10—01684 
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Taking then into account relations (6.4) we find that 

xr, = 2, (rir,) = 2,8, = 24, 

art = 2h ry") = 2l6f =o" 
So 

g,=ar, z=ar', (6.5) 
Using relations (6.5) we write formulas (6.4) as 

e=(rr,)ri, x=(ar')r;,. (6.6) 


Relations (6.6) are called the Gibbs* formulas. We again turn to 
the question of constructing conjugate bases. 
Using formulas (6.6) we get 


ri=(r’ri) ry, or, =(ryr))r'. (6.7) 
Denoting 

pe ps hi hyd 6.8 

SxtH Try, FEF Hrr, (6.8) 


we rewrite relations (6.7) as 
h_ jhi 
ma gry y= Baar. (6.9) 


So it suffices to know the matrix (g**) to construct the basis r* from 
r;, and it suffices to know the matrix (g,;) {0 construct 7, from 7". 
We prove that these matrices are reciprocal. To do this we write the 
scalar product of the first of the equations (6.9) by r,. Taking into 
account relations (6.4) we get 


1, j=k, 
0, jstk. 
These relations show that (g*) and (g,;) are reciprocal. Since the 
elements of the reciprocal matrix can be calculated in terms of those 


of a given matrix, it is clear that using relations (6.9) solves the 
question of constructing conjugate bases. 


} 


6.1.2. Transformations of a basis and transformations of coordinates. 
Let r; and r', i = 1, 2, 3, be conjugate bases, and Jet ry and r* he 
new conjugate bases. 

Using the summation convention we wrife formulas for transfor- 
mation of basis vectors. We have: 

(14) formulas for transition from the basis r, to a new basis ry 
and reverse formulas: 


rye bers, ry = bi ry, i, i= 4, 2, 9; (6.40) 


* Josiah Willard Gibbs (1839-1903) is an American theoretical physicist. 
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(2) formulas for transition from the basis r’ to a new basis r*’ and 
reverse formulas 


rvabiri, riabbri’, i, iv =1, 2, 3. (6.44) 


Since the transformations (6.10) are reciprocal, so are the matrices 
(6j-), and (0; ). For similar reasons (bj) and (bj-) are also reciprocal. 

We prove that (bj-) and (bj) coincide. This will prove that so do 
(bY) and (b}). To begin with, we obtain the scalar product of the 
first of the equations (6.40) by r* and that of the second of the equa- 
tions (6.14) by r,-. Taking then into account relations (6.4) we find 

ryer® = bb (ryr*) = b}-68 = bp, 

ripg = bb (rity) = bbb = he. 


From these relations we get 


be=ryr', (6.42) 
bi. = ryr'. (6.13) 


Since the right-hand sides of relations (6.12) and (6.13) are equal, 
so are the left-hand sides. In other words })- = 0} and this just 
means that the matrices (bi-) and (0}-) coincide. Note that the ele- 
ments b;- of (bj) can be calculated from formulas (6.12). 

We may now say that to change from a basis r;, r' to rj, ri it is 
sufficient to know only the matrix (b3-) of transition from the basis 
r; to r;- (the matrix (b}) is calculated from (b}-)). 

Here is a complete list of formulas for transformations of basis 
vectors: 


. 
ree ber; ry Op ry, 
i’ 7’ ? 7 i i” (6.14) 
rv=bsr*, r=bpri. 
We proceed to derive formulas for transforming the coordinates of 


a vector when changing to a new basis. 

Let z;- be the covariant coordinates of x in the basis r,, r*’. 
According to (6.5) we then have 

wy Lr;-. 
Substituting into the right-hand side of this relation the expression 
for r;- from formulas (6.14) we find 

Ly = @ (bier ;) = bie (xr;) = bier;. 


40e 
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So formulas for transforming the covariant coordinates of a vector 
in changing to a new basis are the form 


xy = Vier). (6.14) 


We see that in changing to a new basis the covariant coordinates of 


the vector 2 are transformed using the matrix (bj-) of direct chance 
from the old basis to a new one. This concordance of transformations 
explains the term “covariant* coordinates of a vector’. Substituting 
into the right-hand side of zi’ = art’ the expression for rt in (6.14) 
we obtain the following formulas after transformations: 


x’ — birt, (6.45) 


We see that in changing to a new basis the contravariant coordinates 


of x are transformed using the matrix (bj ) of inversion from the new 
basis to the old one. This discordance of transformations explains 
why the term “contravariant** coordinates of a vector” is used. 

6.1.3. Invariants of a linear operator. The divergence and curl 
of a linear operator. Jnvariants are expressions independent of the 
choice of basis. For instance, the value of the scalar function ata 
given point is an invariant. An invariant is an object vector inde- 
pendent of the choice of basis. A scalar product of vectors is also an 
invariant. 

Here we shall discuss some invariants of a linear operator. Let A 
be an arbitrary Jinear operator defined on vectors of a three-dimen- 
sional Fuclidean space (i.e. A (ax + By) = ale + fpAy for any 
vectors z and y and any real numbers a and fp). We prove that the 
expression 


rir, sr, Ar *2% (6.10) 


is an invariant. 
We want to prove that in changing to another basis r,-, r” 


ridr,=rl Ary. (6.17) 


Let rz, rr! he a new basis and Iet (bi) be the matrix of transition 
from the basis r,. rr tory, rv’. We have 


, 
ry=bl ry, ris bir’, 


* Covariant means concordantly chancing. 

** Contravariant means changing in an opposite way. 
*** The validity of the equation riAr,;-sr;Art can be seen from the follow- 
ing. According to (6.9) rl = gikr,, r, = grt. Considering that the matrices 
(gih) and (¢;;) are reciprocal and symmetrical we therefore get 


rlAr, = ¢'heqr Art br, Arl=r,Ark=r;,Arl, 


Ch. 6. Basic Field Theory Operations 449 


Substituting these values for r; and r’ in the expression ri‘Ar; we 
get 


ri Ar, = (bj by) rr’ Ary. (6.18) 
Since (bi bie) = 6}, from (6.18) we get 

r Ar, = 6p Ary =r Ary. 
Equation (6.17) is thus proved, and hence so is the invariance of the 
expression r‘Ar;. 

The invariant r‘Ar; of a linear operator A will be called the 
divergence of the operator and designated div A. Thus 

div A=riAr,=r;,Ar'. - (6.19) 


Remark. In a given basis rj, r* a linear operator may be given 
using a matrix called the matrix of the linear operator. It is the mat- 
rix of the coefficients a? of expansion of vectors Ar; with respect 


to the basis r; (it is possible of course to consider the matrix of the 
coefficients of expansion of Ar? with respect to r*): 

Ar,;=ajir,; ai=r’Ar;. (6.20) 
The divergence of the matrix of A may be expressed in terms of the 
elements of the matrix (az). Namely 

div A=aj=ai+a?+ai. (6.24) 


To see that formula (6.21) is valid it is sufficient to substitute the 
expression (6.20) for Ar; in the expression (6.19) for divergence and 
use the relation r’r, = 64. 


We prove that the expression 

[r; Ar‘) = [rt Ar,]* (6.22) 
is also an invariant. We must prove that in changing from the basis 
r;, ri to another basis r;-, rv 

(r,Ar*] = [ry-Ar?’]. (6.23) 
Let rj, r#’ be a new basis and let (bj,) be the matrix of transition 
from the basis r;, r to ry, r’. We have 

r,=biry, ris Dipr*’, 

* The validity of {r;Ar‘] = [r#Ar;] can be seen from the following. According 


to (6.9) rt = gtkr, rr; = g;)r!. ant the reciprocity and symmetry “of (gtk) and 
(g3;) we therefore "get 


[r? Arj] = gtkg;, [r pAr!] = = = 6} [rp Ar) =[r, Ark} = [r; Ari], 
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Substituting these values for r; and rt in [r,Ar'] we get 


[rpeir'] <= (b, be) (re Ar”’). (6.24) 
Since (61 b!,) = Of, from (6.24) we get 


re Arty x Ope [rye Ar!’ ] = [rye Art’). 


Wquation (6.23) is thus proved, and hence so is the invariance of 
r;Ar'}. 

The invariant [r,Ar‘] of a linear operator A will be called the curl 
of the operator and designated curl A. Thus 


curl A= [ryAr') = fr, Ar!) + [rpAr?] + [r Ar’). (6.25) 

We show the expression for the divergence and curl of the linear 
operator -1 for the case of the orthonormal basis ¢t, J, Ik. Since in 
this case the conjugate basis coincides with the given basis, by for- 
mulas (6.20) the elements a,; of the matrix of A can be found from 
the formulas 

Ay, TAT, Ay. TAJ,  04,== tAk, 

Qa, == JA, lon = JAS, Qn, == JAK, (6.26) 

Os = Ai, Asn = kA, Q43 = kAls, 


(in contrast to the general case we have denoted the elements of the 
matrix of A by am; instead of a7). 


for the divergence of A we obtain the following expression: 


e= 


3 
div d= ~, ip = Oy, + og + O33 = TAL + JAZ + LA, (6.27) 
We find the oxpression for the curl of A. Since in the case of the ortho- 
normal basis conjugate bases coincide, from (6.25) we get 
cur] A = [Ai] + [jAj] + [rAk). (6.28) 


We ovaluate the first vector product [zAz). Sinco Ai = a,,t + ao) + 
-+- ay,k, we have 


{?At}) = ay, lit] + a., [iy] + a5, ik] = —aaj + aah. 
Quite similarly we obtain the formulas 
[FAT] = gt —ayahk, [Ake] = — dyyf + agg]. 
Usine these and relations (6.28) for curl A we find 


curl A -- (dg. — Gog) § 4 (Qyg — 83x) J He (@1 — O13) F- (6,28) 
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6.2. THE SCALAR FIELD AND THE VECTOR FIELD. 
BASIC CONCEPTS AND OPERATIONS 


6.2.1. The scalar field and the vector field. Let 22 be a domain in 
the plane or in space. 

A scalar field is said to be given in Q if associated with each point 
M of 2 by a certain law is some number u (MM). 

Note that the concepts of scalar field and function defined in Q 
coincide. The following terminology is commonly employed: a@ scalar 
field is given using a function u (M). 

The notion of vector field is introduced quite similarly: if associat- 
ed with each point M in Q by a certain law is some vector p (M), then 
a vector field is said to be given in 2. We shall employ the terminology: 
a vector field is given using a vector function p (M). 

The temperature field inside a heated body, the field of mass 
density are examples of scalar fields. The velocity field of a steady- 
state liquid flow, magnetic intensity field are examples of vector 
fields. 

6.2.2. Differentiable scalar fields. The gradient of a scalar field. 
The directional derivative. We have already noted that the notions 
of scalar field wu (7) in a domain & and function defined in that field 
coincide. We may therefore define the differentiability of a scalar 
field as the differentiability of the function giving that field. For 
convenience we shall formulate the notion of differentiability of the 
field employing terminology somewhat different from the usual 
terminology. 

A linear form f (Ar) in a vector Ar is the scalar product of that 
vector by some Ar-independent vector g. We shall also use the nota- 
tion: 

0 =o (iM, M’) is the distance between points Jf and M’, 

Ar = MM' is the vector connecting Mf and M', 

Au = u (M’) — u (M) is the increment of the field at . 

We give the following definition. 

Definition 1. A scalar field u(M) is said to be differentiable at 
a point M of a domain Q if the increment of the field Au at M may 
be represented in the following form: 


Au = f (Ar) + 0 (p), (6.30) 


where f (Ar) is a linear form in a vector Ar. 

Relation (6.30) will be called the differentiability. condition of the 
field w(M) at a point M. 

Remark 1. Since a linear form f (Ar) is a scalar product g-Ar, 
where g is a Ar-independent vector, the differentiability condition 
of wu (M) at AT may be written as 


Au = g-Ar + 0 (0). (6.31) 
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We prove that if a scalar field u (1/) is differentiable at a point JV, 
then the representation (6.30) (or (6.31)) for the increment Au of 
that field at A/ is unique. Let 

Au = g-Ar + 0, (p) and Au = h-Ar + 04 (p) (6.32) 


be two representations of the increment Au at Jf. From formulas 
(6.32) for Ar =£0 we obtain the relation 


g—he=—e, (6.33) 
where c= is the unit vector and o(p)=o0.(p)—0o,(p). Since 
24) 2() is on infinitesimal as o9—>0Q, it follows from (6.33) 


}Ar] 
that (g—h)e=0 for any e, i.c. g=h. The uniqueness of the 
representation (6.30) is thus proved. 

We shall say that ascalar field u (AL) given in a domain Q is differ- 
entiable in Q if it is differentiable at each point of Q. 

Definition 2. The gradient ata point Al of a scalar field u (MM) 
differentiable at that point is a vector g defined by relation (6.3)). 

The gradient of a scalar field is designated grad u. 

Remark 2. The above definition of the differentiability of a scalar 
field is convenient in that it has an invariant character independent 
of the choice of coordinate system. The gradient of a scalar field is 
therefore an invariant of that field. 

Remark 3. Note the following important fact: if a scalar field 
u (AI) given in a domain Q is differentiable in that _fomain, then the 
gradient grad u of the field is defined at each point of 2 and is obviously 
a vector field given in Q. 

Remark 4. For the scalar field we introduce the concept of level 
surface (level curve for the plane field) which is a set of points on 
which the values of the field u (A/) are the same. The gradient of 
the field at a point JJ is orthogonal to the Jevel surface at that 
point. The reader will easily see for himself that this remark is 
valid. 

Using the symbol grad u for the gradient of a scalar field we rewrite 
relation (6.34) in the following form: 


Au = grad u- Ar + 0 (p). (6.34) 


Note that the term grad u-Ar is usually called the differential du 
of the scalar field. Thus 


du = grad u-Ar. (6,35) 


Let us agree to employ the term differential dr for the increment 
Ar of the radius vector r = OM, Ar = OA’ ~— OST. Formula (6.33) 
for the differential du of a scalar field may be written as 


du = grad u-dr. (6.36) 
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Let two differentiable fields u (7) and v (JM) be given in 9. The 
following relations are true: 


grad (u-- v)= grad u + grad v, 
grad (uv) =u grad v-+v grad u, (6.37) 
grad oo (for v =£ 0). 


pe 
Ili F is a differentiable function, then 
grad F (u) = F’ (uw) grad u. (6.38) 


The derivations of formulas (6.37) and (6.38) are of the same type. 
As an illustration, we show the validity of the second of the formu- 
las (6.37). Using formula (6.34) and the continuity of the function 
u (M) we have 


A (uv) =u (M')v (M"')—u (M1) vo (M) = 
=u (M’)Av+uv (M1) Au= 
=(u (M) grad v+ v (Jf) gradu) Ar +0 (p). 


From these relations it follows that the increment A (uv) may be 
represented by the form (6.31). Therefore uv is a differentiable func- 
tion and grad (uv) = wu grad v + v grad u. The second of the formu- 
las (6.37) is thus proved. 

We introduce the concept of directional derivative for the scalar 
field. 

Let a field u (47) be given in Q, let Af be some point of 2, and 
let e be the unit vector indicating the direction at M7. Also suppose 
that M’ is any point in 9 other than M and such that the vector 
MM’ is collinear with e. The distance between 47 and M”’ is denot- 
ed by p. 

If there is a limit 


lim Au 

p~0 
(Au = u (M’) — u (M)), then this limit is said to be the derivative 
of the field u at M with respect to the direction e and designated =. 
Thus 

ae) (6.39) 

The following statement is true. 

Let u (M) be differentiable at a point M. Then the derivative = of 
uat M with respect to any direction e exists and can be found from the 
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formula 


ou = grad ue. (6.40) 


Je 


We prove this statement. Let e be any fixed direction and Jet the 


point Jf‘ be taken so that the vector Ar = Jf4T' is collinoar with e. 
It is clear that Ar = pe. Substituting the value of Ar in rolation 
{6.34) we find 


Au = (grad u-e) p + 0 (p). 
From this we obtain the formula 


AY ce 240) 3.4 
3 grad u-e-+ > (6.41) 


Relations (6.39) and (6.41) yiold formula (6.40). This proves the 
statement. 

We find the expression for the gradient of a_ differentiable 
scalar field assuming that in space an orthonormal basis i, j, f 


is chosen referred to a Cartesian coordinate system Oxyz. Sinco 
. ree . ) 

grad ust (grad u-i) +7 (grad u-j) +h (grad u-k) aud = =, 

udu du a 

aj dy? Oko dz 


, using relations (6.40) wo ret 


- Ou 


- OU - OU 
erad u=U ar td Gy th 3: ° 


Using the exprossions (6.40) for the directional derivative gives 
the following graphical picture of distribution of the values of 
directional derivatives of the field 
w (Mf) ina plane domain Q at a given 
point Af. Let grad u 0 (if gradu =0, 
then it follows from (6.40) that 
S = 0 for any e). On the yector grad 
uas diameter (Fig. 6.1) wo construct 
a circle C. We also construct a circle C* 


as large as C and touching it at JZ. 


Fig. 6.1 Let e be an arbitrary direction. Draw 
through jf a half-line in tho direction 
of the vector e. IE the half-line touches C and C*, then a =() (the 


vector e is orthogonal to grad w). If, however, the half-lino intersects 


hoe . Bu. 
C or C* ina point N, then — is equal to the leneth of WN taken 
with the plus sign when V is on C and with the minus sign when is 
on C*, Pora field in space the ciecles C and C* must he roplaced 


with spheres. 
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6.2.3. Differentiable vector fields. The divergence and curl of a vector 
field. The directional derivative of a vector field. Let a vector field 
p (MM) be given in a domain Q of a three-dimensional Euclidean 
space. In what follows we use the notation: Ar = MM’, Ap = 
= p id") — p (M). 

We give the following definition. 

Definition 3. A vector field p (1M) is said to be differentiable at 
a point JT of Q if the increment of the field Ap at M may be represent- 
ed in the following form: 


Ap = AAr +0 (| Ar J), (6.42) 


where A is a linear operator independent of Ar (independent of the 
choice of point M’). 

Relation (6.42) will be called the differentiability conditions of 
a field p(M) at a point M. 

We prove that if p (J/) is differentiable at M, then the represen- 
tation (6.42) for the increment Ap of p(M) at AZ is unique. 

Let 


Ap = AAr + 0, (| Ar |) and Ap = BAr+o,({A4r]) (6.43) 
be two representations of Ap at Af. For Ar = 0 formulas (6.43) give 


(A—B)e= ae (6.44) 
where e= a is the unit vector, o(j/Ar|)=0,({Ar|)—~—o, ({Ar]). 
Since ee is an infinitesimal vector as Ar-»+0Q and e is an 


arbitrary unit vector, it follows from (6.44) that (A—B)e=0 
for any e, i.e. A=8&. This proves the uniqueness of the repre- 
sentation (6.42). 

We shall say that a vector field p (M) given in a domain Q is diffe- 
rentiable in that domain if it ts differentiable at each point of Q. 

We introduce the concept of directional derivative for a vector 
field p (MM). 

Let the field p (/) be given in a domain Q, let M be some point 
of 9, and let e be the unit vector indicating the direction at JM. 
Also suppose that M’ is any point in 2 other than M and such 
that the vector MM’ is collinear with e. The distance between M 
and M’ is denoted by op. 

If there is a limit 

lim AZ 

p->0 
(Ap = p (M’) — p (%)), then this limit is said to be the derivative 


of p (M) at M with respect to the direction e and designated 22 . 
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Thus 
LP jim SP, (6.45) 
JC n~-0 


The following statement is true. Let a field p (J/) be differentiable 
ata point A of Q. Then the derivative = of the field p at that point 
with respect to any direction e erists and can be found from the formula 


up TS 
Fe Ae, (6.46) 


where A isa linear operator defined by relation (6.42). 

We prove this statement. Let e be any fixed direction and Ict 
the point Af’ he taken so that the vector Ar = pe and | Ar | = p. 
Substituting this value of Ar in relation (6.42) and using the prop- 
erties of a linear operator we find 


Ap = pAe + 0 (p). 


From Oeis we obtain the formule 
ray 0 (p) ~ 
—f = Ae 4 p . (6.47) 
iP p 


Relations (6.45) and (6.47) yield formula (6.46). This proves the 
statement. 
Let p (M/) be a field differentiable at a point AJ of O. Then 


Ap = A&ar +o ({ Ar ]). 


We find the matrix of a linear operator A for the case of the ortho- 
normal basis f, J, &. We shall assume that this basis is referred to 
a Cartesian coordinate system Orys. 

We denote by P, Q and R the coordinates of p (V) in i, J, ke. 
Obviously by (6.46) 


at Or * 67 ay 
From these formulas and from relations (6.26) for the matrix of the 


coefficients of a linear operator ini, 7, kit follows that the matrix A 
of the operator A in question has the form 


vx dy bd 
* arg) ag dQ . 
NS Gr Gy Gs | (6.48) 
OR GR aR 
Or uV az 


We introduce the concept of divergence and curl of a vector field 
p (Af) differentiable in ©, i.e. of a field such that its increment Ap 
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at each point Jf of 922 may be represented as 
Ap = AAr+o(|{ Ar }), 


with A in general changing in going from point to point in 2. In 
other words, A is dependent on a point /7 and is of course indepen- 
dent of Ar. 

We shall call the divergence and curl of a linear operator A the diver- 
gence and curl of a field p(M) at a point M of a domain Q. Thus 
by definition 


div p = div A, curl p = curl A. (6.49) 


Remark. Under our hypotheses about the differentiability of the 
field p (Af) in Q div p and curl p are defined at each point of Q. 
Since these objects are invariants (independent of the choice of 
basis), clearly div p is a scalar field and curl p is a vector field 
in QQ. 

We find the expressions for the divergence, curl and directional 
derivative of a differentiable vector field p (M@) assuming that in 
space an orthonormal basis 7, j, /¢ is chosen referred to a Cartesian 
coordinate system Oxyz. We shall assume as above that p (J7) has 
coordinates P, Q, R in i, J, k. 


Since the matrix A of a linear operator A is defined by relation 
(6.48) in the case under consideration and by definition div p = 
= div A, curl p = curl A (see (6.49)), from formulas (6.27) and 
(6.29) we Bet 


div p= — as a. 4. 22 ee al (6.50) 
onl p= ($22) 14(S—98) 74 ($—AE) (051 


To compute the derivative of p (A7) with respect to e we use formula 
(6.46) and the properties of the linear operator. 
Let e = icosa+jcosfh + kcosy*. Then by (6.46) we get 
<P > = de= cosa Ai-+ cos f Aj + cos yp Ak = 
= COS & SP +. cos B => se + COs Y= sb 


=cosa 2 + cos 6 2 P+ cos y 


Thus the iiative ca be most either from the formula 
SP. — cos a 4 P + -cos p SP P +cos y os , (6.52) 


* Since e is a unit vector, its coordinates are of the form {cos a, cos B, cos ¥}, 
where a, B and y are the angles e makes with the Oz, Oy, and Oz axes respectively. 
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or, considering that P, Q, # are the coordinates of p (.\/), from the 
formula 


op (oP _ OP | oP i 
E = | je COS cos f} + =— cos »} i+ 


+(e cos. + 3 cos B+ 22 cos v)i+ 


i 
OR OR , OR . 75 
+(3 cos & +5 /- cos B -+ “55 cos v) fe (0.99) 


6.2.4. Compositions of ficld theory operations. We shall] assume 
that a scalar field u (A/) of the class C** and a vector field p (AJ) of 
the class C* are given in a domain Q of a Euclidean space £°. 

Under these hypotheses grad uv is a differentiable vector ficld 
in Q, div pis a differentiable scalar field and cur] pis a differentiable 
vector field. Therefore the following compositions of operations are 
permissible: 

curl grad u, div grad u, grad div p, div cur] p, curl cur] p. 

We prove that 


curl gradu = 0, divcurl p = 0. (6.54) 
To this end we compute curl gradu and div curl p in a Carte- 
sian system. Since in this case the coordinates of gradu are SY 


fu ou , on the basis of formula (6.51) we have 


ou? Os 
O7u O34 . O7u O2u . 
curl grad u= (>> ~— | + (se -tex i+ 


Oru Ou 

+( ez dy dyaz k= 0, 

Thus the first of the equations (6.54) is true for a Cartesian 
system. By the invariance of the expression for curl gradu, the 
first of the equations (6.54) is proved. We proceed to prove the 
second of the equations (6.54). We again turn to a Cartesian 
system. In this system, according to (6.51), a vector ficld curl p 
has coordinates (=--<2). (<p <2), (-=) , whero 
P, Q, R are the coordinates of a vector p. According to (6.50) 
the divergence of a vector field curl p in Cartesian coordinates 
is equal to the sum of the derivatives of the components of curl 


* A function belongs to a class Ck in © if all of its partial derivatives of 
order kK are continuous. 
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p with respect to like coordinates. Thus 
div curl p= = (4-2) g & = )+ 


az \ oy dz }) ' @y \ és Oz 
@¢6Q o@P 
+3, (3-4, )=0. 


The second of the equations (6.54) is thus true for a Cartesian system. 
By virtue of the invariance of the expression for div cur] p, the second 
of the equations (6.54) is true in any coordinate system. 

One of the basic composed operations of field theory is the opera- 
tion div grad u. It is briefly designated Au, the symbol A being 
usually called the Laplacian* (operator). Thus 


Au = div grad u. (6.35) 


We evaluale the Laplacian in a Cartesian system. In such a 


Ou Ou Ou 


system a vector field grad u has coordinates roy OE" Using 


the expression (6.50) for the divergence of a vector field we get 


6*u °u O-u _ 
Au Setaeta- (6.58) 


The composed operations grad div p and cur] curl p are connected 
by the relation 


curl curl p = grad div p — Ap, (6.57) 


where Ap is a vector whose coordinates in the basis 7, jf, k are equal 
to AP, AQ, AR (P, @, A are the coordinates of a vector field p 
in i, J, &). The reader can easily show on his own the validity of 
relation (6.97). 


6.3. EXPRESSING BASIC FIELD THEORY OPERATIONS 
IN CURVILINEAR COORDINATES 


6.3.4. Curvilinear coordinates. Let 2 be a domain of a Euclidean 
space E°; jet z, uv, 2 be Cartesian coordinates in that space. Suppose 


further that © is a domain of a Euclidean space £? and that 2!, 22, <° 
are Cartesian coordinates in E£°%, 
Consider a one-to-one and bicontinuous mapping of 2 onto 2 
effected via the functions 
zo (zt, x*, 25), y = y (2!, 2, 2), 2 = 2 (cl, 2?, ), (6.58) 


Using the above mapping we introduce in @ curvilinear coordinates 
x’, x*, 2°. The meaning of the term is easy to see from the following 


* Pierre Simon Laplace (1749-1827) is an outstanding French astronomer, 
mathematician, and physicist. 
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arguments. First, associated with each point Af (zr, y, 2) of Q ar 
three numbers z!, z*, z°. More precisely, .W is determined by a triple 
of numbers z', 2°. 23. This explains the term “coordinates” of point 
for numbers z', 2°, 2°. Secondly, if on the right-hand sides of rela- 
tions (6.58) any two coordinates. for example 2° and 2°, are fixed, 
then fora variable x! these relations 
define in Q some curve different in 
general from a straight line. It is nat- 
ural to call that curve a coordinate 
curve x', thus emphasizing that at 
the points of tho curve only the co- 
ordinate zt changes. The coordinate 
curves 2° and 2° are defined quite 
similarly. In general the coordinate 
curves z!, 77, and z*are not straight 
lines. This explains the term “cur- 
vilinear coordinates”. 

We have shown that there are 
three coordinate curves z!, 27, 2 
Fig. 6.2 passing through each As of 2 
(Fig. 6.2). We construct ala point 
fia basis r,, r° associated in a natural way with the coor- 
dinate curves passing through that point. In doing so we use 
relations (6.58). Obviously the derivatives or OH ve 

Ox' dx}? ort? 
lated at a point AZ are the coordinates of the tangent vector to 
wc at that point. We denote this vector by 7,. In a similar way 
we construct the tangent vectors r. and 7, to the curves 2° and 
x3 respectively. Thus, 


i 


calcue 


Ox Oy Oz , _ 
Sars ar gerfr Bad, 2 3, (6.59) 


Yr) = 

For the vectors 73, 72, rs to form a basis we must require that 
they should be noncoplanar. A sufficient condition for this require- 
ment to be met is obviously the condition that the Jacobian 


Or Oy Os 

Ozt Oz! dz! 

“ (x, y,5) = | dx Oy Os 
(ri, z2, 23) 7 | dz? ax? az? 


Ox On Os 


a 


art ozs drt 


should be nonvanishing, for it is equal to the triple product of the 
vectors?,, Me, rz. Using the constructed basis r,, ra, ra the conjugate 
basis r', r?, o* is constructed in a standard wav. 
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So, if curvilinear coordinates z!, x?, x? are introduced in a do- 
main ©, basis vectors r;, r‘? are associated with each point M of Q 
in a natural way. Consider the examples. 

1°. Cylindrical coordinate system. It is introduced with the aid 
of the relations 


Z=pcosg, Y=psing, 2=4z. (6.60) 
Thus zi’ =o, 2? =@q, 2 =z. It is known that the coordinates 
P, g, 2 (or, equivalently, z', x*, z*) change within the following 
limits: 


O<cpct+o, OX gan, —w mS z< oe&. 


These inequalities define in a Euclidean space E? with coordinates 
0, @, 2, (or z?, z*, 2) an infinite domain Q represented in Fig. 6.3. 


Fig. 6.3 


We may therefore regard the introduction of cylindrical coordinates 


in EF? as a result of mapping a domain & of #? into Z*® with the aid 
of formulas (6.60). 

Obviously coordinate curves p (or curves x‘) are straight lines 
passing through the Oz axis perpendicularly to it, coordinate curves 
@ (curves 2’) are circles with centres on the Oz axis, whose planes are 
parallel to the Oxy plane. Coordinate curves z (curves z°) are straight 
lines parallel to the Oz axis (see Fig. 6.3). We find the vectors r,, 


11-—-01684 
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r,,r, and ri, r°, rr’. We have 


Ox oy UZ ° 
{ape pe Gp} a (eos, sin 9, 0} 


; } oP . 
ry fs ey Sz} ={-esin Pp, O COS q, 0}, 


vp? dm ’ 
Ox dy Oz 
r={Z, st, F}=10, 0, 1). 


We stress that the expressions in the braces are the Cartesian 
coordinates of the basis vectors ry, ?2, and rz. One can directly see 
that the basis 7, 7., rg is orthogonal. To calculate the conjugate 
basis we use the formulas of Section 6.41.1. We have 

cos sing OQ 
rears =|—osing pcos O}=p, 
0 0 { 


(roraf-= {pcasgp, psing, GH, 
[rary] ={— sing, cos, O}, 
[7°47] = {0, 0, p}. 

Therefore 


rots} ; 
ri ee ={cosp, sing, O}, 


a [rary] ={-— . A \ 
ial saan 5 SiN Py — COS Py 0}, 


3 yrs) , 

a ={0, 0, 1}. 

2°, Spherical coordinate system. It is introduced with the aid of 
the relations 


x=psinOcosq, y =: sin 0 sin y, 2 = pcos 0. (6.64) 
Thus sz? =p, a =o, 2 = 0. It is known that the coordinates 
p, », O (or, equivalently, 27, 2*, z°) vary within the following 
limits: 


CGcgpc+o, Oc gpc2n,0cd0ca. (6.62) 


Inequalities (6.62) define in i? with coordinates p, g, 0 (or 2’, 
x*, x’) an infinite domain © represented in Fig, 6.4. We may there- 
fore regard the introduction of spherical coordinates in £7 as a result 
of mapping the domain @ of £3 into £7? by means of formulas (6.64). 

Obviously, coordinate curves p (curves zt!) are rays emanating 
from the origin, coordinate curves (curves 7°) are circles with 
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centres on the Oz axis, whose planes are paralle] to the Ozy plane, ~ 
coordinate curves 0 (curves 2°) are semicircles whose centres are at 
the origin and whose planes pass through the Oz axis (see Fig. 6.4). 


We find the vectors r,, 72, rz, and r', r*, r°, We have 


r,={sin 8 cosq, sin®, sing, cos9}, 
={—psin®@ sing, psin8cosg, 0}, 
r,={ocos 8 cos g, pcos8 sing, —psin 0}, 


One can directly see that the basis r,, re, rz is orthogonal. To calcu- 


late the conjugate basis we use the formulas of Section 6.1.14. We 
have 


sinOcos@ sin@ sin cos 9 
r,Pofs =| —psin@sin @ psin 8 cos g QO |= —p*sin9, 
pcos8cos@ pcos Osin ge —psin§ 
[7.7] ={—p’* sin’ 0 cos p —p*sin?6 sin, —p*sin 9 cos 0}, 
irr]={ sing, POs ps 0 }s 
[ryr2] ={—pcosOsin8cosp, —pcosOsinGsin ¢, psin?9 }. 
Therefore 
{__ [7 o%e] __ ° * 6G ° g 
ri——25- = {sin 6cos 9, sin Osin ~, cos 8}, 
rill 
ao [rg] _. __ i sing = cos p 0}, 
po Taf p sin® ’ “sino ? 


r3 == [rr] 


4 1 . 4, 
rary is cos 0cos 9, " cos8sing, — “p sin a} : 


3°. Orthogonal curvilinear coordinate system, A curvilinear coordi- 
Nate system is said to be orthogonal if so is the basis r; defined by 


11* 
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equation (6.59) at each point of Q. The cylindrical and spherical 
coordinate systems just considered are examples of orthogonal curvi- 
linear coordinates. 

We obtain the expression for vectors ri of the conjugate basis for 
the case of the orthogonal coordinate system. 

We introduce the following notation: 


Ht, = |r, ], 4, = |re |, Hs = | rs |. 
It is usual to call ,, H., Hs, Lamé* coefficients or parameters. 


Since the coordinate system is orthogonal and the triple of vectors 
ry, Ma, rs is right-handed, we have 


ryPols=s Ay Hills, [rears] = EPs ry [sry] = 
1 
Hall, Hye | 


Using these relations and the formulas expressing the vectors of the 
conjugate basis in terms of vectors r; (see Section 6.1.1) we get 


nt top etl pp tl, 
AB — HE? ~ FRO 
6.3.2. Expressing the gradient and directional derivative for a Scalar 
field in curvilinear coordinates. Let u (1/) be a differentiable scalar 
field in a domain Q in which curvilinear coordinates z', z*, 7 
are introduced. Under these conditions grad uw is defined at cach 


point of 2 and at each point 2 one can calculate for any direction 
. . Ou . . . 

ea derivative ——. Both the gradient grad u and the directional 
derivative at a given point Jf will be referred to the basis r,, r! 
at that point, the construction of which has been described in Sec- 
fion 6.3.1. 

1°. Expressing the gradient of a scalar field in curvilinear coordi- 
nates. After introducing in Q curvilinear coordinates z!, z*, x° the 


scalar field uw will clearly be a function of the variables z', 2°, 27: 
usu (xt, 2°, 27). 

This function may be regarded as a result of superposing the 

function u(z, y, cs) of variables z, y, 2 and the functions (6.58). 


To calculate the derivatives therefore we may apply the 
OF 


eo id e e e e Ou 
indirect differentiation rule. Denoting — by u, we get 
SZ 


Ou OX ou oy Ou az 
f Ox axl cy ori ' Oz zl ( ) 


* Gabriel Lamé (1795-1870) is a French mathematician. 
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. 0d du @ ; 
Since oa By? — are the coordinates of the vector grad wu in 
e ° e o 
the basis i, j, & referred to the Oxyz system, and — ; — ; 
x z 


< are the coordinates of a vector r;, relation (6.63) may 
oz 


obviously be rewritten in the following form: 
uU;, =r; erad u, (6.64) 


Using the Gibbs formulas (see formulas (6.6) of this chapter) for 
grad u and formulas (6.64) we get 


grad u=(r; gradu) r?'=u,r*. 


So the gradient of a scalar field uw is expressed in curvilinear coordi- 
nates as 

grad i= ur (up =) (6.65) 
In practice we often encounter the case of the orthogonal curvilinear 
system. In Section 6.3.41 we obtained (see Section 6.3.1.3°) an expres- 
sion for the vectors ri of the conjugate basis for the orthogonal 
system. Using these expressions and formula (6.65) we find the follow- 
ing formula for grad u in orthogonal coordinates: 


4 du 4 @u 4 @u 
grad U= Fe ar at ay Gar "et AE Gas Ms (6.66) 


An orthonormal basis e; = r;/H; is considered along with the ortho- 
gonal basis r;. It is easy to see that in e; the expression for grad u 
is of the form 


4 du 4 du 4 @u 
grad us oer at oat VE Gat &- (6.67) 
2°, Expressing the derivative of a scalar field wu (A1) with respect 
to a direction e in curvilinear coordinates. Let e! be the contravariant 
coordinates of a unit vector e in a basis r;, so that 


e=e'r,. 


In Section 6.2.2 we obtained the following formula for the 
derivative s: 


du 
35 = e-grad u 
(see formula (6.40)). Substituting in this formula the expression for 
ein the basis r; and formula (6.65) for grad u we get 
du 


; . i 
ao (e"ry) (uar') = eu, (110) = efuySi = Wet. 
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Thus the derivative of a scalar field u with respect to e can te 
expressed in curvilinear coordinates as follows: 
ener 2k TN 
—— = uc’. (O.65. 
ae i 
6.3.3. Expressing divergence. curl and directional derivative So: 
a vector field in curvilinear coordinates. Let p (\/) be a difierentiah!s 
vector field in a domain 2 in which curvilinear coordinates are intra- 
duced. Under these conditions the divergence and curl of the ficid p 
are defined at each point of O and at each point of Q one can calcu- 


. . . . 7) nh . 
Jate for any direction e a derivative < . The divergence, curl and 


directional derivative at a given point Jf will be referred to the basts 
r;,r’ at that point. 

1°. Expressing the divergence of a vector field in curvilinear coordi- 
nates. After cuvilinear coordinates z!, x*, + are introduced in 2 
the vector field p will obviously be a function of the variables 
a aa 


p=p(z', x, x). 


This function may be regarded as a result of superposing the 
function p(z, y, =) and functions (6.58). To calculate the deriva- 


. a . . . - 2 
tives ~ therefore, we may apply the indirect differentiation 
UX 
a YP he 
rule. Denoting — by p,; we get 
OF 
op os ’ op OY t op dz . 66 
x ff te OL ADs 
Pr 6x ort ' oy dzt ' oz dxf (8.55) 
e ef a a e o o ° 
Since f= Ai, f= Aj, af = Ak, where st is a linear operator 


defined by the equation Ap=AAr+o(jArf{) (see Section 6.2.3). 
from relations (6.69) and the properties of a linear operator we 


get 


OF ., OV «. Of 4 7 
p= A(sit i + 5k} =Ar). (6.70) 


By definition div p = div A = r'tAr,, According to formula (6.7) 
therefore, in curvilinear coordinates the divergence of the vector 
field p CAL) can be calculated from the formula 


. . 7) ft 
div p=r'p; (p.=-5). (.¢4) 
We find the expression for divergence for the case of the orthozonal 


curvilinear coordinate system. Using the expression for the vectors r' 
of the conjugate basis for orthogona) curvilinear coordinates and 
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formula (6. he we Bet 


0 
divp=—> at Pry ta ne Pret oe F Psls ( pi = 7 ). (6.72) 
There is another way of writing formula (6.72). Denote by P* the 


* 
coordinates of the field p in the orthonormal basis C= 
Then, after a number of transformations, the expression (6.72) 
for div p takes the following form: 

. _ 1 0 (P!H.H3) 6 (P*H4H,) 0.(P3H,H >) 
div p= ag |Get et a |. 
(6.73) 


2°. Expressing the curl of a vector field in curvilinear coordinates. 
By definition curl p = curl A = [riAr;]. According to formula 
(6.70) therefore we get 


curl p= [r'p,] (pi=35). (6.74) 


We seek the expression for the curl in the orthogonal curvilinear 
coordinate system. Using the expression for the vectors 7? of the 
conjugate basis for the orthogonal system and formula (6.74) we get. . 


0 
curl p= ay pd + ae ('2Pal-+ HE [rsps] (pi=s5). 


(6.75) 

In the orthonormal basis e;= A the curl of p has the coordinates 
{a “ee |, Ta [eae —“ |, 

wie | ar Ge I} (6.76) 


3o.. Expressing the directional derivative of a vector field in curvi- 
linear coordinates. We use the formula 


Op 
<i = Ae (6.46) 


obtained in Section 6.2.3. Let e = e’r;. Then from formula ©. 46) 
and the properties of a linear operator we get 


Since Ar; = p;, where pa, for the derivative of the vector 
x 0 


field p with respect to a direction e we obtain the following 


* On the right-hand side of this formula we do not sum over i. 
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expression: 
—? =e! py. (6.77) 


6.3.4. Expressing the Laplacian operator in curvilinear orthogonal 
coordinates. We have defined the Laplacian Au as a repeated opera- 
tion div grad u. Using the expressions (6.67) and (6.73) for the gra- 
dient and divergence in curvilinear orthogonal coordinates we obtain 
the expression for the Laplacian. 

In the case under consideration a vector field p whose divergence 
is to be calculated is the field grad u. Substituting (6.67) into (6.73) 
we get 


Au = [ é ( Hat au Ja. 
A HoH; oz! ff, oz* ‘ 


, O [ HsH, Ouy, @ H,H, ov _ 
a: = ( H, ox? )+ sas ( H; sa) |. (5.78) 


6.3.5. Expressing the basic field theory operations in cylindrical 
and spherical coordinate systems. 

1°. Cylindrical coordinate system. By virtue of the results of Sec- 
tion 6.3.1.4° Lamé parameters for cylindrical coordinates are of 
the form 


H, = 1, HH. =o, Hy, = 1. 


In such a case formulas (6.67), (6.73), (6.76), and (6.78) yield the 
following equations: 


ou 41 Gu fu 
grad u== ep , 7 ae &F ae Cx, 


4 oP, Ps OPp OP, 
curl p= (Fe —FE7 Jet (> —_ =) on 
+(4202 ae <2) ¢ 
"\p op p ag fj * 
au= bi (pH). 4 Sey Be, 
Pp ap \" op / * p? oq § az? 


2°. Spherical coordinate system. In this case Lamé parameters 
are of the form 


Hf, =1, H.ez=psin@, M,= 0p. 
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Therefore 
. Ou 
grad u = Cot Sand San “ ep +> p e © 
. _— OP o 4 
div p = Gp (P° P+ Sane aa ap. Go + Paine psin@ “gq (sin 8Pe), 
1 O(sinOPg) gp, 
curl p= p sin @ ( 662=—iCD )ep+ 
4 9Pp 4 4(pPg) 4 a(pPe) 1 9p 
Se ee ee oe 
pam ap p ap p dp p 00 
_ a 2 Ou 4 a . [du } 1 d°u 
Au = ‘p? =o (e ap se )+ po? sin 6 <5 (sin a6 p? sin?6 dg? ° 


In conclusion we give a summary of formulas relating the operations: 
of taking the gradient, divergence and curl to algebraic operations: 


1°. grad (u + v) = grad u + grad v. 

2°. grad (u-v) =ugrad p-+v grad wu. 

3°. grad (+) = 2eradusugrde (v0). 
4°. div(p+q)=div p+ divq. 

D°. div (up) = pgradu+u div p. 

6°. div [pg)=gq curl p— pceurl gq. 

7. curl[p+qj=curlp+curlg. 


8°. curl (up) = ucurl p—[pegrad ul}. 

It is easy for the reader to show the validity of these formulas- 
on his own. 

Concluding remarks. In this chapter we have discussed the basic 
operations of field theory. We have not relied on any physical ideas. 
since our aim was to construct a mathematical theory. In the next 
chapter we shall derive a number of important integral relations 
connecting some of the operations of field theory. These relations wil} 
allow us to give physical interpretation of the concepts and opera— 
tions we have introduced in the present chapter. 


CHAPTER 7 


THE FORMULAS OF GREEN, 
STOKES, AND OSTROGRADSKY 


In this chapter we obtain formulas playing an important role 
i various applications, and in particular in field theory. They 
ure in a sense extensions to the multidimensional case of the 
Newton-Liebniz formula for one-dimensional] integrals. 


7.1. THE GREEN * FORMULA 


7.1.1. Statement of the main theorem. Let D be a finite, multiply 
connected in general, domain in the Ory plane with piecewise smooth 
boundary Z2**, The domain D with the houndary ZL adjoined will 


he designated D. The following main theorem is true. 


Theorem 7.1. Let functions P (z, y) and Q(z, y) be continuous 
in D and have continuous partial derivatives of the ftrst order in D. 
If there are improper integrals of each of the partial derivatives of the 
functions P (xz, y) and Q (x, y) over the domain D***, we have 


\\ (S$-45) az dy = § Pdz+Q dy, (7.4) 
D L 


a relation called the Green formula. The integral on the right of (7.4) 
is the sum of the integrals along the components of L on which a sense 
of rotation is indicaled such that the domain D remains on the left. 

We shall first prove the Green formula for a special but sufficiently 
wide class of domains. We shall then establish a number of auxiliary 
statements which we shall need to prove the theorem we have stated. 

7.1.2. The proof of the Green formula for a special class of domains. 
Let D be a singly connected finite domain with piecewise smoath 


* George Green (1793-1644) 15 an English mathematician. 

** J, is said to be piccewise smooth if it is made up of a finite number of 
smooth curves. If £ consists of a finite number of closed piccewise smooth curves 
L,, then the connected domain D is usually said to be multiply connected, and 
E; are said to be connected components of the boundary L. 

“*4* Since the partial derivatives of P (z, y) and Q (z, ) exist only in an open 
domain D, the integrals are improper. Under the additional assumption Chat the 


partial derivatives in D are continuous the integrals become proper integrals. 
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boundary LZ. We assume that every straight line paralle!] to any coor- 
dinate axis intersects L in at most two points. Such domains will 
be called type-K domains. 

Under the hypothesis there are improper integrals of partial deri- 
vatives of P (z, y) and Q(z, y). This means that for any system 


Fig. 7.4 


of domains {D,} monotonically exhausting D we have for example 


lim [ | 22 de dy = \ \ £2 ax dy 
me “Da D 


(similar relations are true also for other partial derivatives of P (x, y) 
and Q (2, y)). 

We describe how to construct a special system of domains {D,} 
monotonically exhausting a type-AK domain. We shall need it in 
proving the Green formula for those domains. 


Let a closed interval {a, b] on the Oz axis be the projection of D 
onto that axis. Draw through the points a and 8b straight lines parallel 
to the Oy axis. Hither of the two lines intersects the boundary Z 
in one point only. The two points A and # of intersection of the 
lines with Z (Fig. 7.4) divide Z into two curves L’ and L” which 
are obviously the graphs of the continuous functions y, (x) and 
Yo (x) piecewise differentiable on (a, 6], respectively. Note (Fig. 7.4) 
that y, (ct) < ye (x) (equality holding only for z = a and x = b). 

Next consider a sequence of closed intervals [a,, b,}] such that 
a<da,<b, <b, a, >a, 6, > b as n -> 0. In addition suppose 
that for any n an interval [a,, b,] is contained in [a,4,, b,44). 
Choose a number e, > 0 so that the graphs 2, and Li, of the func- 
tions y, (x) + ©, and y, (rz) —e, are in D and do not intersect. 
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The boundary of D, is a curve made up of the curves LZ; and L, 
and the segments of the vertical straight lines passing through the 


points a, and b, (Fig. 7.1). The domain D,.4, is constructed in 
a similar way, but instead of the interval [a,, b,,) we take an interval 
la,=;, b,4,] and choose a numbere,.+, > O less than the number e.. 
Jt is obvions that if ¢, +0, then the constructed system of domains 
{D,} monotonically exhausts the domain D. . 

We prove the following statement. 

Theorem 7.2. Let functions P (z, y) and Q (x, y) satisfy in a lype-k 
domain D the hypotheses of Theorem 7.1]. Then for that domain and 
for the functions P (x, y) and Q (zx, y) the Green formula is valid. 

Proof. It suffices to show the validity of the equations 


\ oe Gedy = § Qa — i “Gy 92 dy =O Pas. (7. 


Since the proofs for these are of the same type, we shall prove only 
the second of the equations. 
Consider the double integral 


{ { as dz dy. (7.3) 


Dn 
For J, and the integrand Fein (7.3) all the conditions hold under 


which the repeated integration formula is valid. From this formula 
we have 


2 OP bn V3(x)-en 9 P 
¢ 
| | Gp dzdy= | dz \ a7 dU = 
Dn Gn = Wa(x)+En 
on bn, 
= [ P (x, Yo(x)— &,) dz — { P (x, 5 (xz) + €,) az. (7.4) 
en dn 


The left-hand side of relations (7.4) as n-> oo has a limit equal 


to | = dxdy. By uniform continuity of P(x, y) in D, each of 


D 

the terms on the right of (7.4) has as nr oo a limit equal to 
l t 

P (xz, y2(x)) dx for the first term and to P (xz, 4y(z)) dz for the 


{1 ed 
second. The former integral, with the direction of circulation 
about the boundary as indicated in Fig. 7.4, is the line integral 


— [ P(z, uydz 
i. 
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and the latter is the line integral 
| Piz, y) dz, 
L’ 


We see that the right-hand side of relations (7.4) as n oo has 
a limit equal to 


—| Piz. y) dr. 
L 


Thus the second of the formulas (7.2) is proved. The validity of the 
first of the formulas (7.2) is estab] 


ished similarly (one must project 
D onto the Oy axis and repeat the above arguments). The proof 
of the theorem is complete. 

7.1.3. The invariant form of the Green formula. Let functions 
P (z, y) and Q (z, y) Satisfy the hypotheses of Theorem 7.1 in 
a finite connected domain D with piecewise smooth boundary L,. 
We define in D = p + La vector field P Whose coordinates in a given 

rtesian coordinate System equal P (z, y) and Q(z, y). Under 
the conditions imposed on the functions P (x, y) and Q? (z, y) the 
field p will clearly be contin D 


uous in D and continuously different- 
lable in D. We seek the curl of p. Using tl 


the orthonormal basis i,j, k 
0 oP 
cur} p=(2- ) k 
Hence 
a 
2 - > = curl p. 


(7.5) 


orthonorma!] 
ated to that 
in the new hasis. In the 
and Q’ Clearly satj 


Remark 1. We change in the Or 
basis i; 


Sis, 


y plane to a new 
new Cartesian System Oz’y’ re] 
Let p have coordinates P’ and Q’ 


the functions P’ 


isfy 
rem 7.14. Moreover, since in the new basjs 
curl p= (22 
Pp (<8 — 37), we have 

90" ap 

Oa Fos Ie curl p. (7.6) 
Si 
— the scalar Product & curl p is an invariant, it follows 
~ (7,5) and (7.6) that the expression ead Changes noj 
Xs form ‘or value unde l ae “ae 
= : singel hl rage 4 change to new orthonormal] busis, 

Ma 

"mark: n, : 


Taw the following important conclusi 1 
tegral on the left of the Green formula (7:4) has = cee 


174 Fundamentals of Jiathematieal Analysts 


ant nature, its form and value remain unallered under a transition to 
a new Cariesian system. Indeed, under such a transformation of 
coordinates the absolute value of the Jacobian transformation 
is equal to unity. According to the remark, however, the integrand 
changes neither form nor value. 

Now consider the integral 


b Pdz+Qdy (7.7) 
L 


on the right of the Green formula. We show that this integral has 
also an invariant nature, its form and valueremaining unaltered under 
a transition fo a@ new Cartesian system. 

Let ¢ be the unit tangent vector at the points of Z whose direc- 
{ion agrees with the direction of circulation about L, and let cose 
and sin a be the coordinates of ¢. Choose as a parameter on & an 
arc Jength 7, the increase in / agreeing on each connected component 
of the boundary with the direction of circulation about that compo- 
nent, Under the conditions on ZL the function 7@ (7) will be piecewise 
continuous. Under the conditions formulated above the vector field p 
is continuous on ZL and its coordinates P and Q are continuous func- 
tions of 1. 

Notice that once the sense of rotation and the parameter of L 
are chosen the line integral of the second kind (7.7) is transformed 
into a line integral of the first kind. P and @ are calculated at the 
points of LZ and dz = cosadl, dy = sing dl. Thus 


sy Pdz+Qdy = (Pcosa-+-Q sina) dl= 4 pt dl. (7.8) 
L L I, 


Relation (7.8) shows that the integral (7.7) has indeed an invariant 
nature: the scalar product pf is an invariant, and parametrization 
with the aid of arc Jength is independent of the coordinate system. 
Moreover, in the new Cartesian system Oz'y’ 


pt dl = (P’ cos a’ + Q’ sin «') dl = P! dz’ + Q" dy’, 
and therefore 

Pdzx+Qdy = P' dr’ +@’ dy’. 
So we have shown that the integral (7.7) has an invariant nature, 
its form and value remain unaltered under a transition to a new 
Carfesian system. . 


The ahove reasoning allows the Green formula (7.1) to assume 
the following invariant form 


{ { curl pdo = § ptdl, (7.9) 
oo L 


where do is an area element of D. 
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Remark 2. The integral 


) pe dl 
L 


is usually called the circulation of a vector field p about a curve L. 

From Theorem 7.2 and the conclusions of this subsection we may 
draw an important corollary. 

Corollary. Let functions P (x, y) and 
O (x, y) satisfy the hypotheses of Theorem 7.1 
in a finite domain D with piecewise smooth 
boundary L. If D can be divided into a finite 
number of domains D, with piecewise smooth 
boundaries L,, (Fig. 7.2) each Dp, being a 
type-K domain with respect to some Cartesian 
system, then for D and for P(x, y) and 
Q (xz, y) the Green formula is valid. 

The validity of the corollary follows from 
the following arguments. It is clear that the Fig. 7.2 
Green formula is valid for each of the do- 
mains D,. This follows from the invariance of the formula and from: 
Theorem 7.2 (in some coordinate system D,; is a type-& domain). 


It is also obvious that the sum of the integrals | | (<= _~ 


Dy, 
—>) dxdy on the left-hand sides of the Green formulas over 
. ; a0 oP , 
the domains D, is \ (3¢-3,} dxdy. But the sum of the line 


integrals b Pdx+Qdy on the right-hand sides of the Green 
Ly 
formulas along the boundaries L, of D, gives & P dx+Q dy, for 


L 
the integrals along the common parts of the boundary of the 
domains D, cancel out, these parts being traced in opposite 
directions in adjacent domains J, (consider Fig. 7.2 for expla- 
nation). 

Remark 3. An arbitrary finite connected domain D with piecewise: 
smooth boundary Z cannot in general be divided into a finite number 
of domains D, of the type indicated above. From every finite domain 
D with piecewise smooth boundary, however, we can remove an 
arbitrarily small part such that the remaining domain can be divided 
in the necessary way. Contributions to the right- and left-hand sides. 
of the Green formula corresponding to the removed part of D will 
accordingly be arbitrarily small. This idea is the basis of the proof 
for the Green formula in the general case. 


tw 
-% 
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[In Section 7.1.4 we shall prove a number of auxiliary proposi- 
tions using which we shall establish the Green formula in the general 
<ase in the way indicated above. 

7.1.4. Auxiliary propositions. Let L be a piecewise smooth plane 
curve without self-intersections on which the arc length ? is chosen 
as the parameter. 

A neighbourhood of an interior point P on L is any connected open 
set of points of & not coinciding with the whole of the curve Z and 

_ p containing the point P. For an end point of L 

we introduce the concepl of half-neighbourhood.* 
{ The length of a neighbourhood (or half-neigh- 
bourhood) is called its extent. 

An interior point P of ZL divides each of its 
neighbourhoods into two half-neighbourhoods. 
A neighbourhood of P is said to be a A-noigh- 
bourhood if cither of the half-neighbourhoads 
is of Jength 2. 

Lemma 1. Let L be a smooth finite curve 
Fig. 7.3 without self-intersections, let A and B be the end 


points of the curve, and let L be a connected part 


of L consisting together with its end poinis A and B entirely of interior 
points of L (Fig. 7.3)**. We can find two positive numbers 7 and § 
such that the supremum of the angles which the tangents at the points 


of the 3-neighbourhood of any point P of L*** makewith the tangent 
at P is less than a!8 and the distances from P to the points of L outside 
the 4-neighbourhood are less than &****, 

Proof. We show that we can find 2 > 0 satisfying the hypotheses 
of the demma. First, we note that given any «@ > 0 we can find for 
every point Pa A-neighbourhood (4 > 0) such that within it the 
supremum of the angles the tangents at the points of that A-neigh- 
bourhood make with the tangent at the point P is less than a. 
‘This follows from the continuity of tangents to the curve JL, 

The question is whether there is a universal 2% suitable for all 


points of the curve Z. 
Assume that there is no %7 > 0 satisfying the hypotheses of the 


lomma. Then given any A, = 1/n on ZL we can find points P, and 


2 


* If Pisan end point of L and @ is any of its other points, then the set of all 
points of L between P and Q including P but not Q@ is said to be a half-netyh- 
hourhoad of P. _ 

** The curve may be closed. In this case 4 may coincide with 4. 1f Lisa 


ecloced curve with one corner point, then ZL is any closed connected part of f 
net containing that corner point. _ 
ee* 4 neichbourhood of a point of Z is considered to be a neighbourhood 


of that point on L. 
-eeee Obviously 4% > 5. 
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Q, such that the length of the arc P,Q, is less than 4, and the angle be- 
tween the tangents at those points is at least equal toa fixed a < x/8. 


We select a subsequence {P,,} of {P,} converging to a point P of L. 
Obviously {Q,,} also converges to P. Consider the A-neighbourhood 


of P in which the supremum of the angles between the tangonts at 
the points of the neighbourhood and at P is less than a@/2. 

It is clear that the angle between the tangents at any two points 
of that A-neighbourhood of P is less than e With n,, sufficiently 
large, the points P,, and Qn, will be in the chosen A-neighbourhood 


of P and therefore the angle between the tangents at those points 
must be less than a@ whereas by the choice of the points it must be 
greater or equal to a. This contradicts our assumption that thero 
is no A > 0 satisfying the hypotheses of the lemma. Note that the 
required A is less than cither of the arcs AA or BB. 

Now we prove that we can find 6>0 satisfying the hypotheses of 
the lemma. 

Assume that there is no 6 > 0 satisfying the hypotheses of the 
lemma. Then given any 6, = 1/n we can find a point P, on Z and 
a point Q, on ZL such that the longth of tho arc P,Q, is equal to or 
greater than 7.* whereas the chord P,Q, is of length smaller than 6,. 
We select a subsequence of {P,} converging to a point P of L and 
consider the corresponding subsequence of {2,}. We select a subse- 
quence {Q,,} of that Jast subsequence converging to the point Q of L. 
It is clear that {P,,} converges to P. Since by the choice of the points 
P,, and Qn, the length of the arc P,,Qn, is equal to or greater than 4, 
so is the length of the arc PQ. Since the lengths of the chords P,,Qn, 
tend to zero, the length of the chord PQ is zero, i.c. the point P 
coincides with Q and is therefore a point of self-intersection of the 
curve L without self-intersections. This contradiction supports the 
possibility of choosing the required 5 > 0. The proof of the Iemma 
is complete. 

Corollary 1. Let the curves LZ and ZL satisfy the hypotheses of 
Lemma 41. Then we can find a number 2A such that any are of ZL 
of length smaller than 2A is 1-1 projected onto one of the coordinate 
axes of a fixed Cartesian system Oxy. 

Indeed, take as A the number indicated in Lemma 1. Any arc 
of L of length smaller than 2 A is contained in a A-neighbourhood 
of some point P on L. The tangent at P makes with the Oz or Oy 
axis an angle equal to or less than 2/4. Then clearly the tangent at 
any point of the arc under consideration makes with that axis an 
angle less than n/2 and therefore the arc is 1-1 projected onto the 


* That such a A exists has been established in the first part of the proof. 
{2—01684 
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axis (were it not 4-4 projected there would be tangents making 
with the axis an angle equal to 2/2). 


“Corollary 2. Let the curves L and L satisfy the h ypotheses of Lemma 4. 


Then we can find a number 2?.>>0 such that any are of L of length 
smaller than 23. is 1-1 projected onto both coordinate axes of a Cartesian 
system Oxy especially chosen for that are. 


Take as %2 the number indicated in the lemma. Any arc of L less 


than 2 7 is contained ina %-neighbourhood of some point P of L. 
Choose a Cartesian system so that the tangent at P makes with 
the coordinate axes an angle a/4. Then the tangent at. any point 
of the arc will make with either of the Oz and Oy axes an angle less 
than a/2 and therefore the arc will be 4-4 projected onto cither of 
the axes. Note that slight changes in the coordinate system chosen 
do not affect the possibility of the arc being 4-1 projected onto 
both coordinate axes. 

Lemma 2. Let @ be a square, and let R be an angle with vertez 
at the centre P of the square Q and with opening 2a <. n/4. Denote 
by I the part of the boundary of Q contained in R. Then the angie be- 
tween any chord of the curve Y (the straight line joining tuo poinis of T) 
and the bisectriz of R is at least a. 

In view of the elementary nature of the Jemma we leave it unproved. 

Lemma 3. Let Q be a square and let L bea smooth curve without self- 
intersections emanating from the centre P of Q. Also suppose that the 
supremum of the angles the tangents to L make with the half-tangent 


to L at P is equal toa <x/8. Then L intersects the boundary of Q 
in at most one point. 


Proof. Construct an angle Ff of opening 2¢, 2a < 2a << n/4 whose 
bisectrix is the half-tangent to Z at P and whose vertex is the centro 
P of the square. Denote by fF the part of the boundary of Q contained 
in R. Obviously LZ is inside # (if L did not intersect a side of R 
at a point other than P, then there would be a tangent paralle} te 
that side and the tangent would make with the half-tangent to 


L at P an angle equal to a > a, which contradicts the hypothesis). 
Let D intersect I in two points Af and N. Then on L we could find 
n point the tangent at which would be parallel to the chord J/N 
and according to Lemma 3 that tangent would make with the half- 
tangent to L at P an angle at least equal io a@ > a@ and this contra- 
dicts the hypothesis. The proof of the Jemma is complecic. 
Corollary of Lemmas land 3. Let Land L satisfu the hypotheses 
of Lemma 1 and let 6 > 0 be the number indicated in that lemma. 
Then L intersects the boundary of any square Q with cenire at an arb- 
itrary point P and with side of less that V 26 in at most tivo points, 
We shal] demonstrate the validity of the corollary. Let P be an 
arbitrary point of Z and let ~>0 he the number indicated in 
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Lemma 1. Consider a A-neighbourhood of P. Both end points of the 


neighbourhood and the part of Z outside the ?-neighbourhood are 
according to Lemma 1 outside any square with centre at P and of 


side less than V 26. Therefore the A-neighbourhood under considera- 
tion (and only that neighbourhood) intersects the boundary of Q*. 
Since either of the half-neighbourhoods of the A-neighbourhood of 
P under consideration satisfies the 
hypotheses of Lemma 3, it is clear 
that the A-neighbourhood intersects 
the boundary of Q in at most two 
points. 

7.1.5. Special subdivision of a 
domain D with piecewise smooth 
boundary ZL. Let D be a multiply 
connected domain whose boundary 
ZL consists of a finite number of 
closed piecewise smooth curves, 
P,, Po, ..-, Py being the corner 
points of L. We assume a Cartesian 
system Oxy to be chosen in the 
plane. 

We shall show a method of spe- 
cial subdivision of D into subdom- 
ains. We shall need such subdivisions in proving Theorem 7.1. 

1°, Weshow that for any « > Owecan choose the squares Q,, Qe, ... 
..., Oy with centres at the corner points of £ and with sides parallel 
to the Ox and Oy axes (Fig. 7.4) so that the following conditions hold: 


(1) The boundary of any square Q; with centre at P; intersects 
either of the two branches of Z emanating from P}* in exactly one 
point (see Fig. 7.4). Those points are the only points the boundary 
of a square Q; has in common with that of L. 

(2) The sum of the areas of Q; is less than e; the sum of the lengths 


of the parts of Z that are in the squares Q; is also less than ¢. The 


sum of the perimeters of the squares Q; clearly is not greater than 
Ae, where A is some constant. 


The possibility of the above choice of the squares Q; arises from 
the following arguments. 

Consider the A-neighbourhoods of corner points subject to the 
requirements: 

4. These A-neighbourhoods do not intersect. 


Fig. 7.4 


* Here we are using the Jordan theorem which states that if two points of 
a continuous curve Z are an interior and an exterior point of a domain D then L 
intersects the boundary of D. 

** A sufficiently small A-neighbourhood of a corner point P; consists of two 
smooth branches emanating from that point. 


12% 
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2. The sum of tho lengths of all #-neighbourhoods is less than ec. 
3. The supremum of the angles the tangents of cither of the half- 
neighbourhoods of a 4-neighbourhood made with the corresponding 


half-tangent at a corner point is less than @ < 2/8. The possibility 
of the choice of such A-neighbourhoods of corner points is obvious. 
Note that either of the half-neighbourhoods of the 2-neighbourhoods 
choson Satisfies the hypotheses of Lemma 3. Therefore eithor of these 
half-neighbourhoods intersects in at most one point the boundary 
of any square with centre at the corresponding corner point. 

For every corner point ?, we define a number 6, > 0 equal to 
the infimum of the distances from P; to the part of Z obtained by 
removing from £ a neighbourhood of the point P;. 

We denote 5 = min {5,, 5,, ..., Sy}. It is clear that any square 
Q, with centre“at P; the length of whose side is less than 1/2 § satis- 


fies tho above condition (1) for, with the choice of square Q, indicated, 
the hypotheses of Lemma 4 hold for either of the half-neighbourhoods 
of the point P; and, in addition, the end points of the half-neigh- 
bourhood are outside the square Q, (this ensures the uniqueness of 
the pointofintersection of the half-neighbourhood with the boundary 
of the square). It is also clear that by making smaller tho sides of 
the squares we can make the sum of their areas becomo less than e. 
Obviously, the sum of the lengths of the parts of Z that are in the 


squares Q, will be less than e owing to a Special choice of #-neigh- 
bourhoods of the corner points. Thus condition (2) also holds when 
Q, are chosen as indicated. 


2°. We remove from £ those parts which are in the squares Q;. 
The remaining part of Z is a collection of smooth curves L, without 


common points, some of Z, boing smooth closed curves. Note that 
every open curve Z,; consists of interior points of a smooth curve L,; 
whose end points are the corner points of Z (see Fig. 7.4). 


For cach of the curves LZ; wo uso Lemma 1 of the preceding sub- 


section. Let 2; and Sf be numbers guaranteed for ZL; by that lemma. 
We make 6f obey yet another requirement, that 57 should be less 
than the infimum of the distances from the points of Z, to the other 
curves Z;,. Further denote 4°= min {%,, Ao, ..-) Ay} and O< 
< 6* < min {6f, 53, ..., Sk}, 6* < ]/2 4, where & is the num- 
ber chosen in 1°. Obviously 2% > 86”. 

We divide each curve Z; into a finite number of parts of length 
Jess than 5*. We construct squares Q; whose centres are at division 
points of the curve Z,, with sides of length 6* parallel to the Oz 
and Oy axes. 


3°. Using squares Q; and Q; we construct the required subdivision 
of the domain D 
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(1) Remove from D the parts D and the squares Q; have in com- 
mon. The remaining part of D is designated D, and the boundary 


of D, is designated Z,. The boundary L, consists of curves Lj and 
the line segments parallel to the coordi- 
nate axes. y 


(2) Denote by oF the part a square Q; 
and the domain) D, have in common. The 
domains Q; divide the domain D, into sin- 


gly connected parts D7, the boundary of 
each consisting of line segments parallel to 
the coordinate axes and of possibly one 
curvilinear segment contained in one of the 


curves L; and having alength smaller than 6. 0} x 


Since that curvilinear segment is 1-4 projec- 

ted onto one of the coordinate axes (the Fig. 7.9 

length of each of such segments is less 

than 6* <4 and in this case according to Corollary’ 1 of Lemma 14 


it is 4-4 projected onto one of the coordinate axes), any domain D; 
clearly can be divided with straight lines parallel to one of the coora- 
inate axes into a finite number of parts D;,, each a rectangle or a cur- 
vilinear trapezoid** possibly degenerated into curvilinear triangle. 

Figure 7.5, shows a domain D;, the dotted lines indicating a sub- 
division of D; into parts D,. 

7.1.6. Proof of theorem 7.{. We have just seen that after removing 
from D the parts that are in squares GQ; we obtain a domain D=** 
with boundary ZL, which can be divided into a finite number of 
domains D; of specia] kind. 


We prove that the Green formula is true for D,. According to the 
corollary of Section 7.1.3, to do this it is sufficient to show that each 
of D; is a type-&k domain with respect to some specially chosen 
Cartesian system. 

If D, is a rectangle, then the required system is for example a coor- 
dinate system one of whose axes is parallel to a diagonal of the rect- 
angle. Let D, be a curvilinear trapezoid or, a curvilinear triangle. 
From the way D;, are constructed it follows that the curved side of 


* A domain D is said to be singly connected if any piecewise smooth, not 
self-intersecting closed curve in D bounds a domain all points of which are in D. 
** Recall that a curvilinear trapezoid is a figure whose bases are parallel 
to one of the coordinate axes, one of whose lateral sides is parallel to the other 
coordinate axis and the curved lateral ide is 1-4 projected onto that axis. 


*** Recall that Q; arechosenforany given positive & so that the sum of their 
areas is less than ¢ and the sum of the lengths of the parts of the boundary L 


that are in Q; is also less than ¢. It is clear that as e + 0 the domains D, exhaust 
D. 
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the boundary of D; satisfies the hypotheses of Lemma f in Section 
7.4.4 and therefore, according to Corollary 2 of the lemma, is i-{ 
projected onto both coordinate axes of a specially chosen Cartesian 
system. Sinee slight changes in the chosen system do not violate 
the indicated property, we may clearly choose a coordinate system 
such that the rectilinear parts of the boundary of D,, aro also 41-4 
projected onto its hoth axes. With respect to that system D), is 
a type-K domain. So for D, the Green formula 

t | (-5 al 5—)dzdy=) Pdz+Qdy (7.10) 

€ Le 


is true. From the way domains D, are constructed it follows that 
as e—Q the Ileft- and right-hand sides of formula (7.10) have 


respectively the limits { { (= a” dxdy and { Pdr+Qdy. 


Ox “Ou 
LL 


The proof of Theorem 7.1 is complete. 


7.2, THE STOKES ° FORMULA 


7.2.4. Statement of the main theorem. Let § be a bounded complete 
piecewise smooth two-sided surface with piecewise smooth boundary 
pe *, 

A neighbourhood of S is any open set Q containing S§. 

The following main theorem is true. 

Theorem 7.3. Let functions P (x, y, 2), Q (x, y, s) and R (z, y, 2) 
be continuous and have continuous partial derivatives of the first order 
in some roca of a surface S. Then the following relation holds 


\}( (5-4 7 2) dy dz + 


oP @R\, , ,¢ 00 OP _ 

+(5--Sp) dedzr+(So~—S—)dedy = 

= Pdz+Qdy+Rdz (7.14) 
2 


called the Stokes formula. The integral on the right-hand stde is a 
sum of integrals along the connected components of Y on which a direc- 
tion of circulation is indicated such that with regard to the choice of 
surface side the surface S remains on the left. 

Using Romark 2 of Section 5.3.2 on tho form of notation for sur- 
face integrals of the second kind and symbols X, Y, Z for the angles 


* George Gabriel Stokes (4819-1903) is a well-known English physicist 
and mathematician. 
** Note that a closed surface hag no boundary. 
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the normal to the surface forms with the coordinate axes we can 
rewrite the Stokes formula (7.141) as follows: 


\) [ (Sp — Se) cos ¥ + (45) cos ¥ + 


+(32—-) cosZ |do= h Pde+ Qdy+ Raz, (7.12) 
Tr 


In the subsections that follow we shall prove a number of proposi- 
tions we shall need to prove the stated theorem. 

7.2.2. The proof of the Stokes formula for a smooth surface 1-1 pro- 
jected onto three coordinate planes. The following theorem is true. 

Theorem 7.4, Let S be a bounded complete smooth two-sided singly 
connected surface with piecewise smooth boundary TI. We assume S 
to be 1-1 projected onto each of the coordinate planes of a system Oxyz. 
Let P, Q, and R be functions given in some neighbourhood of S, con- 
tinuous in that neighbourhood and having in it continuous partial 
derivatives of the first order. Then the Stokes formula (7.11) is valid. 

Proof. To prove the theorem we turn to the form (7.12) of writing 
the Stokes formula. We shall assume the unit normal vectors to 
form acute angles with the coordinate axes. 

Obviously the theorem will be proved if we prove the equations 


\ J (Fe cos ¥—F— 0032) do= @ P de } 


\ (2 cos z—Peosx} do=§ Qay, | (7.13) 
r 


Oy Ox 


yo wn rm 


\ (32 cos X—2 cos ¥ } do= Raz. 
r 


Since relations (7.13) yield themselves to proofs of the same type, 
we shall prove only the first of them. 

We denote by J the integral on the left of the first of the equations 
(7.13): 


I= J (F—cos ¥ —-— cos Z) do. (7.14) 


Under the hypothesis the surface S is smooth and 1-41 projected 
onto the Oxy plane. Therefore § is the graph of the differentiable 
function z = z(z, y). In this case, with regard to the orientation 
of the unit normals to S, cos Y and cos Z can be found from the 
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formulas 
cos ¥ =, C08 Z = ae, (7.15) 
1+ p?~—9? Vo 1-> p?-- 9" 
-} __ Oe _. dz 
WHEE P= "TWrs I= ay 
Using formulas (7.415) relation (7.14) may be rewritten as 
follows: 


I= -jJ (+ oF) cos % ao. (7.16) 


Since on S the values of P (z, y, 2) are equal to P (z, y, = (Zz; Y)); 
by the indirect differentiation rule 


a ap , YaP 
ay [Pe uy, a(t, ING ta are 
Therefore relation (7.16) becomes 


J=— \ sy (P (2, y, z(x, y))] cos Z do. (7.47) 


Let D be the projection onto the Oxy plane of the surface S 
and let Z be the projection onto that plane of the boundary I 
of S. Obviously the surface integral on the right of (7.17) is equal 


to tho double integral | [ x [P (xz, y, 2 (2, y))] dz dy (see Remark 2 


a * 


D 
in Section 5.3.2) and therefore 
a r 
I= — \ Gr lP (ts ys a (zy) dz dy. (7.18) 
Applying to the integra] on the right of (7.48) the Green formula 
we get 
fad ry} n ~ 
\\e [P(z, y, 2(z, y)))dzdy= —& P(x, y, (2, y))dz. (7.19) 
D L 
Suppose a point Al (z, y, z) of I is projected into the point 
NV (z, y) of the curve LZ. Then clearly the value of the function 


P (x, y, 2) at the point AY of I coincides with the value of P (x, y, 
z(z, y)) at the point N of Z. Therefore 


6 Plz, y, a(x, y)idc=O P(z, y, 2)dz. (7.20) 
Li r 


Obviously relations (7.44), (7.18) to (7.20) yield the first of the 
equations (7.13). The second and third of these equations are proved 
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similarly, but it is necessary to consider the projections of S onto 
the Oyz and Ozz planes respectively. The proof of the theorem is 
complete. 

7.2.38. The invariant form of the Stokes formula. Let functions 
P (xz, y, 2), O (x, y, 2), and AR (az, y, z) be continuous and have 
continuous partial derivatives of the first order in some neighbourhood 
Q of a surface S. We define in Q a vector field p whose coordinates 
in a given Cartesian system are equal to P, Q, R. Obviously, under 
the conditions imposed on the functions P, Q, R the fild p is con- 
tinuous and differentiable in Q. We find the curl of p. Using the expres- 
sion for curl p in the orthonormal basis i, 7, k we get 


curl p= (SE — 20) 4 (EF (— AE) (7.29 


We choose on S a definite side, i.e. indicate on S a continuous field 
of unit normals m. Turning to the expression (7.24) for curl p and 
using the standard notation cos X, cos Y, cos Z for the coordinates 
of the unit normal vector n to S we get 


n curl p= (<2) cos X ++ (+ -=) cos Y + 
+(£- cm ) cos Z. (7.22) 


It follows from relation (7.22) that the integra] on the left of the 
Stokes formula (7.12) may be written as 


\) neurl p do. (7.23) 


So once a definite side of the surface is chosen the integral on the 
left side of formula (7.12) may be regarded as the surface integral 
of the first kind (7.23) of the function n curl p given on S. Since 
the scalar product p curl p and the area element do of S are indep- 
endent of the choice of Cartesian system in space, under a change 
to a new orthonormal basis it’, j', &’ the left-hand side of formula 
(7.42) remains unaltered in form and value, i.e. it is invariant 
under a choice of Cartesian system in space. 
Now consider the integral 


 Pdx+Qdy+ Raz (7.24) 
Tv 


on the right of the Stokes formula. 

We show that this integral also has: an invariant nature, its form 
and value remain unaffected under a transformation to a new Carte- 
sian system. 
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Let ¢ be a unit tangent vector at the points of the boundary [ 
of S whose direction agrees with the direction of circulation about LP, 
and let cos « cos fh, cos y be the coordinates of ¢. Choose the arc 
length / to be the parameter of IT, increases in the parameter on 
each connected component of f agreeing with the direction of circula- 
tion about that component. Under the conditions imposed on IF 
the function ¢ (1) is piecewise continuous. Since p is continuous 
on [, its coordinates are continuous functions of 2 on [. Note that 
once the direction of circulation and the parameter on [ are chosen 
the line integral of the second kind (7.24) transforms into a line 
integral of the first kind, with P, Qand R calculated at the points of 
Y and dt = cosa dl, dy = cos fi dl, dz = cosy dl. Thus 


§ Pdz+Qdy+Rdz= 
tT 


=~) 
No 


= (Pcosa+Qcosf-+ Reosy)di= ft ptdl, (7.29) 
F r 


Relations (7.25) show that the integral (7.24) has indeed an invariant 

nature: the scalar product pé is an invariant, parametrization with 

the aid of the arc length is independent of the coordinate system. 
In the new coordinate svstem Oz’y’z’ we have 


pt dl=(P' cosa’ + Q’ cosBf + Rf cos y’) dl = Pl dzf + 
AQ’ dyf + R' dz". 
Therefore 
Pdzx+Qdy+Rdz = P' dz’ +- QO" dy’ + R' dz’. 
Note that the integral 


b pt dl 
r 


is usually called the circulation of a vector field p about the curve I. 
The above reasoning allows the Stokes formula (7.11) (or (7.12)) 
to assume the following invariant form: 


| { nourltp do = ptdl. (7.26) 
“s r 


7.2.4. Proof of Theorem 7.3. We prove the following auxiliary 
statement. 

Lemma. Let S be a bounded complete two-sided smooth surface 
with piecewise smooth boundary [*, There is 6 > 0 such that any con- 


* Note that a closed surface haz no bounJary. 
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nected part of S whose size is less than 5* is 1-4, projected onto each 
of the coordinate planes of some Cartesian system. 

Proof. We first show that some neighbourhood of each point AT 
of such a surface is 1-1 projected onto each of the coordinate planes 
of some Cartesian system. 

Let mar be a unit normal vector to the surface at W/. Choose a Carte- 
sian system Oxyz so that my, makes acute angles with the Oz, Oy, 
and Oz axes. Then in that coordinate system the determinants } 


Yu Zy tu Yu 
9 
Yo 2p Ly Yo 


are Clearly nonzero for the values of uw and v determining the point M, 
and by the smoothness of S they are nonzero in some neighbourhood 
of the point (uw, v) (these determinants are proportional to the coord- 
inates of the unit normal vector to the surface). Turning to the proof 
of Theorem 5.1 and to the remark to the theorem (see Section 5.1.2) 
wesee that some neighbourhood of the point is 1-1 projected onto 
each of the coordinate planes of the chosen coordinate system Ozyz. 

Assume that the statement of the lemma is false. Then for every 
§, =i/n, n =1, 2, ... we can find a part S, of S whose size 
is less than 6, and which is not 1-1 projected onto the three coord- 
inate planes of any Cartesian system. Choose in every part S, a point 
M,, and then choose a subsequence of the sequence {//,} converging 
to some point M of S. Consider the neighbourhood of MM that is 
1-1 projected onto each of the coordinate planes of some Cartesian 
system Oxyz. The neighbourhood contains one of the parts S, that 
is also 1-1 projected onto the three coordinate planes of the system 
Oryz. But this contradicts the choice of parts S,. Thus the assump- 
tion that the statement of the lemma is false leads to contradiction. 
This proves the lemma. 

Now we proceed to prove Theorem 7.3. Divide S with piecewise 
smooth curves into a finite number of smooth parts S,, each of the 
size less than 6 indicated in the lemma we have just proved. Add to 
the curves dividing S the edges of the surface. Since S; is 1-1 projected 
onto the three coordinate planes of some Cartesian system, by the 
invariance of the Stokes formula (see Section 7.2.3) and the conclu- 
sions of Section 7.2.2 the Stokes formula is true for a part S;. Now 
sum the left- and right-hand sides of the Stokes formula for the parts 
S;. Obviously the sum of the left-hand sides of the formulas is the 


double integral \ \ n curl p do and the right-hand side is the sum 
S 


Zy ty 


Zy Xp 


of the integrals p pt dl along the boundaries I; of S;. It is clear 
Ry; 


* Such a part of the surface may be in a sphere of radius 6, 
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that the integrals along the common parts of the boundaries of 
S; cance) out for they are traced in opposite directions (consider 
Fig. 7.6 for explanation). Therefore the above sum of line integrals 


is equal to the line integral along the boundary I of the surface S. 
Our reasoning implies the validity of the formula 


\ | 2 curl pda = pi dl 
° r 


§ 
which is just the Stokes formula. The proof of the theorem iscomplete 


7.3. THE OSTROGRADSKY FORMULA 


7.3.1. Statement of the main theorem. Let V be a finite, in general, 
multiply connected, domain with piecewise smooth boundary S* 
in the Oxys space. The domain ¥ with adjoined boundary will be 
designated VV. The following main theorem is true. 

Theorem 7.5. Let functions P (z, y, 2), Q (x, y, 2), and R (z, y,=) 
be continuous in V and have continuous partial derivatives of the first 
order in VY. If there are improper integrals of each of the partial deriva- 
tives of P, Q, and R over V, we have the relation 


\\) (s+ 4+) dz dy dz= 


=|( Pdyds+Qdsdz+Rdzdy (7.27) 
= 


* The boundary S is said to be piecewise smooth if it is made up of a finite 
number of smooth surfaces adjoining one another along smooth curves, the 
surface edges. If S consists of a finite number of closed piecewise smooth sur- 
faces S;, then S; are said to be connected components of § and the connected do- 
main V is said to be multiply connected. 
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known as the Ostrogradsky formula. The integral on the right of it is 
the sum of the integrals along the connected components of S on which 
the side exterior to V is chosen. 

We confine ourselves to proving the Ostrogradsky formula for 
only a special class of domains. 

Note that Theorem 7.5 can be proved by extending the method 
used in Section 7.4 to prove the Green formula. 

7.3.2. The proof of the Ostrogradsky formula for a special class of 
domains. A singly connected finite domain V is said to be a type- K 
domain if every straight line parallel 
to any coordinate axis intersects the 
boundary S of Vin at most two points. 

For the type-K domain special sys- 
tems of exhaustive domains {V,} are 
used. We now describe how such sys- 
tems are constructed. 

Let a domain D in the Ozy plane be 
the projection of a domain V onto that 
plane. Draw through the boundary 
points of D straight lines parallel to 
the Oz axis. Each of the straight lines 
intersects the boundary S of V in one 
point only. The set of these points divi- 
des S into two parts, S’ and S’, 
(Fig. 7.7) that are the graphs of the Fig. 7.7 
functions z, (z, y) and Z, (x, y) conti- 
nuous in D and piecewise differentiable in D. Note that 2, (x, y)< 
<q (x, y) (equality holding only at the points of the boundary of D). 

Consider an arbitrary sequence of domains {D,} monotonically 
exhausting D. Let S; and S;, be the graphs of the functions z, (x, y) -+ 


+, and 2, (z, y) — & given on D,, (€, is chosen to be so small 
that the surfaces S, and S, do not intersect). 


The boundary of V, is the surface made up of the surfaces Sp 
and S};, and part of a cylinder with generators parallel to the Oz 


axis, the boundary of a domain D, serving as the directrix of the 
cylinder. The domain Va+i iS constructed similarly, but instead 
of D, we take the domain D,4, and choose €,4, to be less than é,. 
It is obvious that as €, 0 the system { V,} monotonically exhausts 
the domain V. 

We prove the following statement. 

Theorem 7.6. In a type-K domain V, let functions P (x, y, 2), 
O(z, y, 2), and R (zx, y, 2) satisfy the hypotheses of Theorem 7.5. 
Then for that domain and for the functions P, Q, and R the Ostrogradsky 
formula is valid. 
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Proof. Clearly it suffices to show the validity of the equations 


(\{ Sh dzdyas= || Payas, 


J Ox 


22 az dyds= { { QOdzdr, (7.28) 


OY 
& 


| Sp dzdy d= \ \ Rdzdy. 


the 
S 


Since their proofs are of the same type we prove the third equation. 
Consider the triple integral 


| | SPaeay as. (7.29) 
Tn 
For the domain Y,, and for the integrand 0R/dzin the integral (7.29) 


all the conditions hold under which the repeated integration for- 
mula is valid. From this formula we have 


Z3(X_ N)~-En 


| | | Shardyaz= || azdy [ Stace 


Yn Dn 2x(x, y)ten 

= | R(x, Ys 2g (Z, Y)— En) dx dy— 
= 

— f \ R(x, y, 24(z, y)+e£,)) dx dy. (7.30) 
Dn 


The Jeft-hand side of relation (7.30) as n->0 has a limit equal 
to { { on de dydz. By the uniform continuity of the function 


t 

R (x,y, 2) in the closed domain V each of the terms on the light 

of (7.30) has as n—~oo a limit equal to { | R(x, Y, 22 (x, y)) dz dy 
“p 

for the first term and to { | R(x, y, 2, (x, y)) dx dy for the second. 


D 
The former of the integrals is (if the exterior side of S is chosen) 
the integral { [ R(x, y, 2) dx dy and the latter (taking into account 
“se 
the “minus” sign preceding it) is the integral R(x, y, 2) dzdy. 
*s? 
So the right-hand side of relations (7.30) las as n-- co a limit 
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equal to | { Re, y, z)dxdy. Consequently the third of the for- 


mulas (7.28) is proved. 

The first and the second formulas (7.28) are proved similarly (it 
is necessary to consider the projections of V onto the Oyz and Ozz 
planes respectively and repeat the reasoning). The proof of the theorem 
is complete. 

7.3.3. The invariant form of the Ostrogradsky formula. Let func- 
tions P, Q, and R satisfy the hypotheses of Theorem 7.5 in a finite 
connected domain V with piecewise smooth boundary S. We define 
in V a vector field p whose coordinates in a given Cartesian system 
Oxyz are equal to P, Q, R. Under the conditions imposed on these 


functions the field p is clearly continuous in V and differentiable 
in V. 

We find the divergence of p. Using the ex pression for the diverg- 
ence of p in an orthonormal! basis i, j, k, we get 


a 


diy’ p= —— - one 

Remark. We transform to a new Cartesian system in space. 
Let i’, j’, ke’ be an orthonormal basis referred to that system and 
let P’, QO’, R’ he the coordinates of p in that basis. Obviously the 


functions P’, Q’, R’ are continuous in V and differentiable in V 
(they are linear combinations of the functions P, Q, R). 
Since in the new coordinate system 
G c 
div p =<, Sn ae z + Jc 


\ 


by aaa invariance 
aP' , a0’ , aR’ 
tt 2p a tte 


Thus, if P, Q, R are considere as the coordinates of the vector 
feld p, the expression < = OO 4 OR — remains unaltered in both 


form and value under a transformation to a new Cartesian system, 
i.e. 1S an invariant. 

We may therefore make the following important conclusion: 
the integral on the left of the Ostrogradsky formula (7.27) has an invar- 
iant nature, its form and value remaining unaltered under a transforma- 
tion to a new Cartesian system. Indeed, under such a transformation 
of coordinates the absolute value of the Jacobian is equal to unity. 
But according to the remark the integrand remains unaffected in 
both form and value under a transformation of coordinates. 
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We now look at the integral 


| | Pdyds+Qdzdr+ Rdzdy (7.31) 
*§ 


on the right of the Ostrogradsky formula (7.27). We shall show that 
the integral also has an invariant nature, its form and the value of 
the integrand remaining unaffected under a transformation to a new 
Cartesian system. 

Using Remark 2 of Section 5.3.2 on the notation of the surface 
integral of the second kind and symbols X, Y, Z for, the angles 
the normal n to the surface forms with the coordinate axes we may 
rewrite the integral (7.31) as 


| (PcosX+Qcos Y¥Y + Reos Z) do. (7.32) 
me 


The integrand in the integral (7.32) is a scalar product np and the 
integral (7.32) (or equivalently the integral (7.31)) may therefore 
be written in the following invariant form: 


[Jno 


Note that this last integral is generally called the flux of the vector 
field p through the surface S. 

Turning to the invariant notation of the integral (7.31) we sce 
that in the new Cartesian system the integral has the form 


\ P’ dy’ ds' +0! dz dzr' +R dz' dy’. 
“s 


Our arguments here allow the Ostrogradsky formula (7.31) to 
be written in the following invariant form: 


| \ \ div pdu = { \ npdao, (7.33) 
Vv Ss 


where dv denotes the volume clement of V. 

From Theorem 7.6 and conclusions of this subsection we may draw 
an important corollary. 

Corollary. Let functions P (x, y, 2), 0 (z, y, 2), and R (z, y, 2) 
satisfy the hypotheses of Theorem 7.5 in a finite domain V with piecewise 
smooth boundary S. If V may be divided intoa finite number of domains 
V,, with piecewise smooth boundaries §;, each of V;, a type-K domain 
referred to some Cartesian system, then for V and P, Q, and R the 
Ostrogradshy formula is true. 

The validity of the corollary is obvious from the following 
reasoning. It is clear that the Ostrogradsky formula is true for 
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each of the domains V,. This follows from the invariant nature 
of the formula and from Theorem 7.6 (in some coordinate system V’;, 
is a type-K domain). Further, it is obvious that the sum of the 
integrals | | | (= £2 | “} dx dy dz from the left-hand sides 
g J) | Uae 7 ay Gs y 
h 
of the Ostrogradsky formulas for the domains V; is the integral 
)  { (= +344) dxdy dz. But the sum of the surface inte- 


v 
grals \ P dy dz+Qdzdzx+ Rdzdy on the right-hand sides of the 
S 


k 
Ostrogradsky formulas along the boundaries S, of V; gives 
\ \ Pdydz+Qdzdz+Rdz dy, for the integrals along the common 


parts of the boundaries of V; cance] out, the parts in adjacent 
domains V; being opposite in orientation. 


7.4. SOME APPLICATIONS OF THE FORMULAS OF GREEN, 
STOKES AND OSTROGRADSKY 


7.4.1. Expressing the area of a plane domain aS a line integral. 
Let D be a finite plane connected domain with piecewise smooth 
boundary L. The following statement is true. 

The area o of D may be calculated from the formula 


c= a Gady—y dx (7.34) 


where the line integral is a sum of integrals along the connected com- 
ponents of L, a direction of circulation being indicated on each of the 
components such that D remains on the left. 

To prove the statement, consider in D the functions 

P(t, y)=—y, QO, y) ==. 


These clearly satisfy in D all conditions under which the Green 
formula (7.1) is true. From this formula we have 


\ (42 —-2 | dady=9 (—y) dx-+ (zx) dy. 


The double integral in the last formula equals 20 and the line integral 
equals & 2 dy — y dx. Thus formula (7.34) is proved. 


L 
7.4.2. Expressing volume as a Surface integral. Let V be a finite 

connected domain in space with piecewise smooth boundary S. 
The following statement is true. -- 


13—01684 
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The volume v of V can be calculated from the formula 


| 
v= 


< | ( xdydzt+ydsdz--sdzrdy (7.35) 


s 


in which the surface integral is a sum of integrals along the connected 
components of S, the side exterior to V being chosen on each of the com- 
ponents. 
To prove the statement, consider in V the functions 
P(z,yz)=27, OGY 2=y, Ry, y, 2) = 2. 


These clearly satisfy the conditions under which the Ostrogradsky 
formula is truc. By this formula 


al ( 21) 4 Sw) 26) ) dx dy dz= 
‘ ¥ 


ax ou CO: 


=| xdydz-+-ydsdz+-cdxrdy. 


The triple integral in the Jast formula equals 3v. Therefore the Jast 
formula implies relation (7.35). This proves the statement. 

7.4.3. Condilions under which the differential form P (x, y) dx +- 
+ Q (a, y) dy isa total differential. Here we show a number of con- 
ditions under which the differentiable form P (x, y) dx + Q (x, y) dy 
given in the connected domain D is a total differential of some func- 
tion wu (x, y). 

We prove the following theorem. 

Theorem 7.7. Let functions P (xz, y) and Q (x, y) be continuous 
in a domain D. Then the following three conditions are equivalent. 
; 1. For any closed (possibly self-intersecting) piecewise smooth curve 
L in D 


) Pdz+Qdy=0. 
L 


2. For any two points A and B of D the value of the integral 
\ Pdz+Qdy 


AB 
is independent of the piecewise smooth curve AB Joining the points 
A and B and lying in D. 

3. The differential form P (x, y) dz 4+- Q (x, y) dy is a total differ- 


ential. In other words, a function u (Al) = u (zx, y) is given in D 
such that 


du= Pdxr+Q dy. (7.30) 
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In this case, for any points A and B in D and for an arbitrary piece- 
wise smooth curve AB joining those points and lying in D 


\ P dr4-Qdy =u(B)—u(A). (7.37) 
s , 
Thus, fulfilling each of conditions 1, 2, 3 is necessary and sufficient 


for either of the other two to be fulfilled. 
Proof. We prove the theorem using the diagram, i.e. prove that 


CY 


the first condition implies the second, the second implies the third, 
and the third implies the first. It is obvious that this will prove the 
equivalence of conditions 1, 2, 3. 

First step: 4 +2. . Let A and B be arbitrary fixed points in D, 


and let ACB and AC’ B be any two piecewise smooth curves joining 
those points and lying in D (Fig. 7.8). The sum 8 
of the curves is a piecewise smooth (possibly 


self-intersecting) closed curve 1 = ACB -- C f 
BC'A in D. Since condition 1 is assumed fulfil- 
led, we have y 
) Pdx-+Q dy=0. Fig. 7.8 
L 


From this, considering that L = ACB + BC'A and that under 
a change of the direction of circulation the line integral changes 
sign, we get the relation 


[ Pdx+Qdy= \ Pdz+Qdy. 
ACB AC’B 
Consequently condition 2 holds. 
Second step: 2 +3. Let Af, be a fixed point, let AY (xz, y) be an 


arbitrary point of D, and let AJ,AJ be any piecewise smooth curve 
in D joining the points AJ, and Af. 
By condition 2 the expression 
u(M)= | Pdz+Qdy (7.38) 
Mya 
13% 
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is independent of wJ/)4/ and is therefore a function given in D. 
We prove that at cach point 47 of D there are partial derivatives 


Ou OU . 
A rs and iy? with 


te 


5 
—— = P (z, y), ——=( (z, u). 


(7.39) 


Since P (z, y) and Q (zx, y) are conti- 
nuous in D, tho last relations imply 

G| x the differontiability of u and equation 

Fig. 7.9 (7.36). This will prove the second step, 
2 —>3. 

The existence of partial derivatives off u (a, y) is proved simul- 
taneously with equations (7.39). We prove, for instance, the existence 
of du/dx and the first of the equations (7.39). Choose a point AF (2, y). 
Assign tojthe independent variable z an increment Az small enough 
for the segment AYN joining Jf (z, y) and N (a +- Az, y) to lie 
in D* (Fig. 7.9). We have 


Aursu(x--Ax, y)—u(z, y= 
_ | Pdz-+-Qdy— ( Pdz+Qdy= \ Pdz+Q dy. 


wee” ee” 
MoMN | MoM 


On JIN the value of y is constant and therefore | Ody = Q. Con- 
AN 
sequently 
x+Ax 


Au = | Pde = [ P(t, y) dl. 


MAN x 
Applying to the last integral the mean value theorem we get 
Au = P (x + OAz, y) Az, where 0 <0 <1, 
whence 


MC = P(r4-0Az, vy), O<O<4, 

By the continuity of P(az, y) the right-hand“side of the last 
equation hag a limit as Az-»0 equal to the value of this function 
at Af (x,y). Consequently, the left-hand sido too has the same 
limit equal by definition to — . The existence of a partial deriv- 

* Since D is a domain, i.c. a set consisting of interior points only, such a 
choice of Ax is possible. 
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ative and the validity of the first of the equations (7.39) are 
thus proved. The existence of a partial derivative 2— and the 


validity of the second of the equations (7.39) are proved 
similarly. 

We now prove relation (7.37). Let A and 8 be any points in D, and 
let AB be an arbitrary piecewise smooth curve joining these points 
and lying in D. The curve is defined by the parametric equations 
x=x(t) y=y(t), axt<b. Using the rule for evaluating 
line integrals we get 

b 
| Paz+Q dy = | {P (x(t), y(t)) 2’ (t)+Q (z(t), y@)) y’ ()} dt = 


a 


ba ( 


b 
= \ u;dt=u(x(b), y(b))—u(z(a), y(a))=u(B)—u(A). 


Formula (7.37) is thus proved. 

Third step: 3 +1. This statement follows from formula (7.37). 
Indeed, for a closed curve JZ the end points coincide and therefore 
by formula (7.37) 


§ Pdz+Qdy=u(A)—u(A)=0. 
L 


The proof of the theorem is complete. 
Remark. We noted that conditions 1, 2, 3 of Theorem 7.7 are 
equivalent and therefore, in particular, condition 3 is a necessary 


and sufficient condition under which the line integral { Pdx+ Ody 
L 


is independent of the choice of curve LZ joining any given points 
A and B in a domain D. 

For singly connected domains* we shall show a necessary and suf- 
ficient condition, convenient for applications, that the differential 
form P dz -+ Q dy should be a total differential of some function. 


Naturally this condition is necessary and sufficient for Pdz-+- 
+-Q dy to be independent of the choice of curve Z joining any 
given points A and & in D. 

Theorem 7.8. Let functions P (xz, y) and Q(z, y) and their deriv- 
atives be continuous in a singly connected domain D. Then each 
of the three conditions 1, 2,3 of Theorem 7.7 is equivalent to 


* Recall that a domain D is said to be singly connected if any piecewise 
smooth, not self-intersecting, closed curve in D bounds a domain all of whose 
points are in D. 


495 Fundamentals of Mathematical Analysis 


condition 
ap on, 
4A, ——= —~ in D. 
OU cx 


Proof. We use the diagram 
] 
a 
7 


We have already proved the statements 1-2—3. Now we prove 
that 34 and 4—1. 

First step: 3— 4, Let there exist a function u(z, y) in D such 

du Ou 
—_ an een —_—_- — 

that du= P dz+-Qady. Then [az = P, 7 Q 

OP | = 0 du\_ dQ 

dy” Oy ( Ox =—(=)= ox ° 

Thus condition 4 holds. Note that proving the step 3 —4 does 
not require D to be singly connected. 

Second step: 4 +41. Let condition 4 
hold. Then at each point of D 


—-— —- = 0), (7.40) 


If LZ is a closed piecewise smooth 
curve without self-intersection lying in D 
and bounding a domain D* (D is singly 

Fig. 7.40 connected and therefore each point of 
D* isin D), then applying the Green 
formula to D* and using (7.40) we get 


b Pdz-+Qdy= \ \ (<2 ~=—) ardy=0. 
L D* 


az dau 


When L has a finite number of points of self-intersection or is 
a broken line with a finite number of components, & Pdzx+Qdy=0 


ry 
for each loop Z of L and therefore p Pdz+Qdy=0 for L. 


L 

Let ZL be an arbitrary closed piecewise smooth curve. Choose for L 
a number 7 > 0 in the way indicated in Lemma 1. Divide Z into 
components L, of length less than 4 (the corner points of J are also 
among the division points, see Fig. 7.10). According to Lemma ! 
the tangents at the end points 1/7, and .V, of each of the components 
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I, make an angle Jess than n/8. Then obviously for a sufficiently 
small A the curvilinear triangle M),N;C;, (the shaded triangle in 
Fig. 7.10), in which 17,C, makes with the tangent at J/7, an angle 
less than 1/8 and N,,C, is the normal to LZ at N,, is entirely in D 
and is a closed piecewise smooth curve without self-intersections. 
Therefore 


P dx+-Q dy =0. 
My,N},Cp 


It follows that the line integral along the arc M,N » is equal to 
the line integral along the broken line M;,C,,Nx: 


\ Pdz+Qdy= \ Pdzx+Q dy. 


a M,C,N» 


Similarly for any component L, we obtain in D a closed broken 
line Z for which 


§ Pdz+Qdy=@ Pdx+Qdy. (7.41) 
4 L 


We noted above that for a closed broken line Z in D the integral 
\P dx + Q dy =0. From this and from (7.41) we get 
L 


& Pdx+@ dy =0. 
L 


The proof of the theorem is complete. 

7.4.4. Potential and solenoidal vector fields. Earlier (see Sec- 
tions 7.1.3, 7.2.3, and 7.3.3) we introduced the concepts of circula- 
tion and flux of a vector field. We recall these notions. 

Let a continuous vector field p (17) = p(z, y, z) be given in 
some domain D. 

Definition 1. The circulation of a vector field p about a closed piece- 
wise smooth curve L lying in D is the integral 


. pi dl, 
L 


where t is a unit tangent vector to L and dl is the differential of the 
are length of L. 
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Definition 2. The flux of a vector field p through an oriented piece- 
wise smooth surface S in D is the integral 


| { pr do, 


. s 


S 


where nis a unit normal vector to S indicating the orientation of S 
and do is the area element of S. 

We introduce the concepts of potential and solenoidal vector fields. 

Definition 3. A vector field p is said to be potential in D if,the circu- 
lation of the field about any closed piecewise smooth curve in D is cero. 

Definitiond. A vector field p is said to be solenoidal in D if the 
flux of p through any piecewise smooth, not self-intersecting closed 
surface in D that is the boundary of some bounded subdomain of D is 
zero. 

For continuous differentiable vector fields and a special class 
of domains we shall prove a theorem containing necessary and suffi- 
cient conditions for a field to be potential. 

As a preliminary we shall introduce the concept of three-dimensional 
singly-connected-surface domain or SCS domain. 

A three-dimensional domain D is said to be a singly-connected- 
surface domain (or SCS domain) if for any piecewise smooth closed 
curve £ in D we can find an orientable piecewise smooth surface 
S in D such that its boundary is LZ. Note that for that surface § 
the Stokes formula is true. 

The following theorem holds. 

Theorem 7.9. Let a continuously differentiable vector field p = 
= {P, Q, R} be given in a SCS domain D. Then the following three 
conditions are equivalent: 

4. The vector field p = p (MM) is potential. 

2. In D there is a potential function u (M4), i.e. a function such that 
p = grad u or equivalently 


du= Pdzr+Qdy+ Radz. 


In this case for any points A and B in D and jor an arbitrary piecewise 
smooth curve AB joining these points and lying in D 


[ pt di=u(b)—u(A), 


wae 


AB 
where tis a unit tangent vector to AB and dl is the differential of an arc. 
3. The vector field p = p (MM) is irrotational, i.e. curl p = 0 in D. 
Condition 3 is obviously equivalent to the relations 
ap _9Q) 20 _ aR aR _ AP. 
ay oz’? ds Oy’ ox dz ° 
Either of conditions 2 and 3 is thus a necessary and suffictent condi- 
tion for the differentiable vector field p to be potential. 
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Proof. We use the diagram 


C) 


The statements 1—>2 and 2—-3 are true without assuming D 
to be a SCS domain and their proofs are quite similar to those of 
the corresponding statements of Theorems 7.7 and 7.8. 

We prove the statement 3 —>1. 

Let Z be a piecewise smooth closed curve in D. Under the hypothe- 
sis D is a SCS domain. There is therefore a piecewise smooth surface 
S in D such that its boundary is Z. By the Stokes formula (7.26) 


9 pedi = | | neurl pdo. 


L 
From this and from the condition curl p = 0 we get 
b ptdl=0, 
L 


i.e. the field p is potential. The proof of the theorem is complete. 

In conclusion we prove the theorem on necessary and sufficient 
conditions for a vector field in the so-called singly-connected-volume 
domains (or SCV domains) to be solenoidal. A spatial domain D 
is said to be a singly-connected-volume domain or SCV domain if 
any piecewise smooth, not self-intersecting closed orientable surface 
in D is the boundary of a domain that is also in D. 

Theorem 7.10. For a continuously differentiable vector field p 
to be solenoidal in a SCV domain D it is necessary and sufficient that 
at every point of D 

div p = 0. 

Proof. (1) Necessity. Let 7 be an arbitrary point of D. Consider 
any sphere S with centre at AZ lying entirely in D. Applying to the 
ball Dg with boundary S the Ostrogradsky formula (7.33) we get 


| | | divpav= J { npac. (7.42) 
" Dg “Ss 
Since the field p is solenoidal, { \ np do = 0 and therefore by (7.42} 
“Ss 
{ { { div pdv = 0. Applying to the last integral the mean value 
“ae 


theorem we see that at some point of Ds div p = 0. By virtue of 
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the arbitrariness of the ball and the continuity of the field p this 
implies that div p vanishes at JJ. The necessity of the hypotheses 
‘of the theorem is thus proved. 

(2) Sufficiency. Let S be any closed, piecewise smooth, not self- 
intersecting, orientable surface in D. Since D is a SCV domain, S is 
the boundary of the domain Dg that is also in D. Applying to Ds 
and to p the Ostrogradsky formula (7.33) we obtain the relation 


[ | npdo =0. 


@ e 


~ 


‘Since S is an arbitrary piecewise smooth, not self-intersecting closed 
orientable surface in D, the last equation implies by definition that 
the field p is solenoidal in D. The proof of the theorem is complete. 


SUPPLEMENT 
DIFFERENTIAL FORMS IN EUCLIDEAN SPACE 


75.1. Skew-Symmefric Multilinear Forms 


78.1.4. Linear forms. Let V be an arbitrary n-dimensional vector space whose 
‘elements are designated &, yn, .... We shall study functions associating with 
each element & € V some real number. 

Definition J. A function a (&) ts said to be alinearform if for any & € V, 
NE Vand any real number + 

(1) a@(§ + m) = a (8) + a (n), 

(2) a (24.5) = Aa (8). 

Definition 2. A sum of two linear forms a and b ts a linear form assoctating 
with each vector 5€ Vo a aumber 

e (8) = a (§) +- b (8). 


A product of a linear form a by a real number + is a linear form 8 assoclating 
with each rector &€ Vo a number 


b (6) = da (8). 

The set of all linear forms thus forms a vector space which we designate 
L, (V)*. We find the representation of the linear form ain some basis {e;}Jeay- 
Let 


nh 
- “Ys 
c=“) Sley, 
{-- 
where the numbers 2! are determined uniquely. Denoting a, = a (e,), the desired 
tTepresentation has the form 
Tt 
~ % we 
a(’)= \ Stay. 
iz-{ 


* The space L (¥) is also designated V* and called conjugate (or dual) to V. 
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: : inecar L(V) is n. 

; ; he dimensionality dim L(V) of a linear space a 

he Pit per iee ‘0 find some basis in L(V) containing exactly n elements, 
ae . linear forms. Choose an arbitrary basis {e,} of the space F a 


the following linear forms: 
eh (@) = ER (kA=1, 2, eats n), 


i i ith respect to the 
h) are the coefficients of the expansion of the vector E wi 
i: ihe basis len}. In other words a linear form e* acts on the ements of 
@ 


the basis {e;} by the 


4 when i=k 
eh (1) = Sin = { O when {= k. 


In such a case in the given basis (e;} the linear form a is of the form 


n 


a(t)= S\ aye! (&), a; =a (e1), 
iax{ 


i.e. the linear forms e! (&), e? (&), . . ., e® (&) form a basis in L (V). This basis is 
called conjugate (or reciprocal or dual) to the basis {e;}. . 
78.1.2. Bilinear forms. Denote by x V a set of all ordered pairs (§, $9), 
where ¢, € V, & € V, and consider the functions a Gr: €.) associating with each 
element of ! X V (i.e. each{two elements &, € V and & € V) some real number. 
Definition. A function a (&, &,) is said to be a bilinear form if a? each fixed 
value of one independent variable it is a linear form in the other. 


In other words, for any vectors &,, §, 1, No and any real numbers /4, As, 
“1 }ly 


0 (4481 > HaMs Aeb2 + Male) = Ayroe (Er, &2) + Aypea (1, Ne) + 
T MrAsa (Mh, 2) + Hatter (Ms M2). 


nd of all bilinear forms is easy to convert into a linear space by introducin 
aah ina natural way the operations of addition and multiplication by a rea 
i e resulting space of bilinear forms is eH arn L, (V). 


e Gnd the representation of the bilinear form a (E,, &,) in some basis {e;}Puy 

n 

of the space V. Let E, = 2 Ene; k= 1, 2. We puta (e,, €;) = ayy to 
P jus 
obtain the desired representation 


rs - S 7 
Gr &) = SOS ap ied. 
iui jom{ 
T ° é P 
me - fostermine the dimensionality of the s 


pelreaaietaar ace L(V 
bilinear forms: (§) constituting in F 2 (V) we form by means of 


L (V) a basis conjugate to (e,} the following 
Bis Se) =el (E,) ef (B,). 


‘ - 
an arbitrary bilinear form can be uniquely represented as 


ac = n | . 
1 ssl pa az sels (E,, $5). 


This 
the a:> Means that ms ef rm 
“imensionality fn ‘s Gr E2) form a basis in L, (V) and consequently 
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78.1.3, Multilinear forms. Let p bea natural number. We use VP = VX VX 
vo... M Vito denote the set of all ordered collections (2), 2, .. .. $2) of p ver- 
lors, each Tying in V, and consider the functions associating with each of such 
collections some real number. 

Definition. A function a (8. So... +. Sp) is said to be a multilinear form of 
degree p (or p-form) if it is a Utnear form in each of the independent variables with 
the valucs of the rest fixed. 

Introducing linear operations into the set of all p-forms we obtain a Jinear 
space which we designate L,, (V). 

We find the representation of an arbitrary multilinear form a (3. $2, .. .. =p) 
in some basis {e)}P, of V. We denote 


Vis f> vty =F (Cj, zo oaey ei) 


Hence if &, = “je; k= 41, 2, ..., py then 


- - 


i 


< < tye i 
G(Eq, Say eee, Ep) = » see » Viste. sipSl 2 6 EDP. 


If e® (€) is a basis in L (V) conjugate to {e,}, then obviously the p-forms 


dyin. .d . e a {* i © 
e P (Eq, Say woes Ep) ett Es) c7? (Ee)... © P (Ep) 
form a basis in Ly (V) and thus Z,, (V) is of dimension nP. 


75.1.4. pSkew-symmetric multilinear forms. 

Definition. A multilinear form a (€,, 6a,-.~, Ep) is said to be skew-symmetric 
if under a permutatton of any tuo (ndependent variables it changes sign*. In other 
words 


a (1, S24 ee eg Sis ao @ a Gjs ee #4 Sp) = —Q (G1) Qo, ae og ofr oe ef mie o 2 oF Spe 


Obviously, a set of all multilinear skew-symmetric forms of degree p form 
a subspace of Ly (1) which we designate A, (V)**. The elements of A, (V) are 
designated w = a (&), &s, .- -s Sp). 

Notice that if {e;} is an arbitrary basis in V and 


rt n 
— NN’ N1 of { 
@ —_ eos a! qy {pet eee & p? 


then the numbers Ol gtp change sign under a permutation of two indices. This 
follows from the fact that 


Oi tp KOC wees ei, 


It is natural to consider that A, (V¥) = L, (V) and that 4, (WV) consists of 
all constants, i.e. coincides with the number line. 

78.1.5. External product of skew-symmetric forms. Consider two skew-syin- 
metric forms wP € A, (V) and w7 € «lg (V'). We introduce here a basic operation 
in the theory of skew-symmetric forms, the operation of external multiplication, 


* Skew-symmetric multilinear forins are also called antisymmetric, skew, oF 
extertor. ; 
** This space is also designated APV* and called the pth exterior degree of V*. 
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Let 


OP=OP (Nir Nor -++ Np), WEV, 
@T= 0T(Ey, Coy seer Gq), SEV. 
Consider the following multilinear form a = Dp4g (V): 


a (§), Sn eves Epaq) = OP (Sip ses Ep)-w4 (Spar eee, Epeq)s (7.43) 


This in general is not skew-symmetric. That is, under a permutation of &; 
and §, where i< i<p and p+i<j<ap-+4q, the form (7.43) may not 
change sign. It is this that necessitates the introduction of external product. 
To introduce external product we shall need some facts from permutation 
theory. 

Recall that a permutation of numbers {1, 2,..., m} is the function o = 0 (1) 
defined on these numbers and mapping them in a one-to-one manner onto itself. 


The set of all such permutations is designed Sm Obviously there are in all m! 


different permutations in S’,,. For two permutations o € )),, and 1 € dm 


a superposition ot € > im is uniquely defined. The permutation'c™ is the inverse 
of ojif o-3¢g = aa-1 = e, where e is an identity permutation {i.e. © (k) = k, 
k= 141, 2,..., m). 

A * rutstios o is said to he a transposition if it interchanges two numbers, 
leaving the others fixed. In other words, there is a pair of numbers ¢ and j 
G<igm {<j<m, ij) such that o (@) =j, o(/) =i, and o(k) =k 
fork = iand k # j. Clearly, if o is a transposition, then o-? = o and o-o = &. 

It is known that any permutation o can be represented as a superposition of 
transpositions interchanging adjacent numbers, the parity of the number of 
transpositions in such a decomposition being independent of the choice of de- 
composition and is called the parity of the permutation o. 

We introduce the following notation: 


sgn o={ 1, if o is even, 
—i1, if o is odd. 

Notice that a form a € Ly, (V) belongs to Ap (V) if for any permutation 
o€N 

@ (Sot1)1 Sota)y o-+9 Sa(p)) = SEN Ga (Ey, Ber ee Ende 

Consider again the multilinear form (7.43). For any permutation o ef >) 
we set 

0a (61, coey E peg) = (Bot1)s ooos Ea( p4q))> (7.44) 

It is easily seen that ift€ vie+a and ao € Seto then (tc) a = t (ca). 


We introduce the following definition. 
Definition. The external product of a form w? € Ap (V) and a form w€ Ag (V) 
is a form © € Ansa (V) defined by the equation 


B+ 


@(E1, -.+1 Epeq) = >} sgn G-0a, (7.45) 

oO 
where the sum is taken over all permutations o € > ptq satisfying the condition 
o(ij)<o(2)<...<0(p), o~t+ip<---<o(p+q) (7.46) 


and the value of oa is given by equations (7.43) and (7.44). 
The external product of w? and w7 is designate 


o= wP A of 
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By way of illustration, consider the action of a permutation o sati-fying 
condition (7.46). Suppose two columns of vehicles are moving in parallel along 
some road, the first column consisting of p vehicles and the second of g vehicles. 
After some time the road becomes narrower and the two columns rearrange inte 
one column without stopping. The vehicles of the first column eccupy plices 
somewhere among the vehicles of the second, but the sequence of vehicles within 
either column remains the same. Asa result, we obtain a permutation satisfying 
condition (7.46). It is easy to see that, conversely, any such permutation can be 
realized in our model, 

To see that our definition is correct it is necessary te prove that m= 
= oP A wT € Ani, (V). Clearly it is only necessary to prove the antisymmetry 
of the form w. 

We show that under a permutation of two independent variables &; and €;4; 
the form w changes sign. It will then easily follow that a € Anas (V). Let 


TE S44 be such a permutation. We shall seo that 


eve 


TH = —& = (sen T) ow. (7.47) 
From (7.45) we get 


to = \' (sgna)-(t0) a. 
G 


We split this sum into two: 


ta = NV (sgn a) (ta) a-- S*” (sgn a) (ta) a. — (7-48) 
G Oo 


The first sim wil] contain those permutations o for which either a7 (4) < p, 
at (i+ 41) <a p or o7 (i) Sp td, o78 (6+ 4) > pt 4. For each of such 
permutations 


(ta) a = —oOa, 


To make this statement clearer, we denote k = o7' (i), b= a7 (f=: 1), ie. 
t= (k), i -+- 1 = o (I). The form oa isa product of wm? and w9, the independent 
variables of wP being the vectors Eo), Sara), . » +s Egcpy: and of wT the vectors 
So(p4-i)>° 7) So(peqy JER S pandl < p, then &, = Eocny and Spi = Eacyy are 
the independent variables of the form w? which is skew-symmetric ns stated. 
Consequently, interchanging & and &;4, makes oP and Jence oa change sign. 
The case where k > p + 1 and 1 > p -+ 1 is considered along the same fines. 

So for the first sum we have 


S) (sgn) (to)a= — SN" (sgn) aa. (7.49) 
9 0 


The second sum will contain those permutations o for which either o-! (é) <7 p, 
oF (i+ 1) Spd or ao () Spt di, at (E+ 1) Sp. We show that the 
sct of permutations {0} satisfying this condition (and of course condition (7.46) 
as well) coincides with the set of permutations of the form to, where o € {a). 
Let us turn to our two-column model. The statement will assume the following 
obvious fori. 

Tf under any rearrangement a vehicle & of the first column finds itself directly 
in front of a vehicle 2 of the second column, then we can easily find another rear- 
rangement as a result of which those vehicles would interchange placcs, the 
sequence of the other vehicles remaining unaltered. 

Thus, since sgn ta = —seno 

N’" (sgno) (ta) a= — N°” (sgn 0) (16) a=: — \’" (egna) oa! (7.540) 


\ 
Go 1 3 
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Substituting (7.49) and (7.50) in (7.48) we obtain (7.47). 
Example {. Consider two linear forms / (€) € A; (V) and ¢ (2) € A; (Y). 
The external product is the bilinear form 


i A e= S (sgn o)-of (&) g (Eo) =f (1) 8 (Be) —e (Ex) 7 Ee) 
1c 
Example 2. Let f (§) € A; (J), 
product o=f{Agisaq-+ i-form 
20> Sir ++ -s Sq 


w= YS (sgno) of (Eo) g (E1, Sor «eer EQD= 


g(x, fo. .--> Eg) £ Aq (V). The external 
whose in ependent variables we denote by 


= Ft (— 14)! f (3) g (Eo, coop Sinz» Si41> ooo €q)- 


78.1.6. Properties of external product of skew-symmetric forms. (1) An 
obvious property of external product is linearity: 
(a) if wP € Ap (V), TE Ay (VY). then for any real number 7 
(.@P) A of = oP A (A407) = 7% (WP f\ 09); 
(b) if wP € Ap (V), of € A, (V) and w%€ A, (V), then 
(of +02) A ot=of A 9+ OF A oF. 
(2) Anticommutativity. If @P € Ap (VY) and 7 € Ag (1), then 
OP A of=(—1)?9 oF A oP. 
Proof. Let 
OP \ OT=O= 0 (1, Ea, --0> Spaq)- 
It is easy to see that 
OF J OP =O (Eps, Spas. oe er Epays Exrx eeer EP) 


We show that the permutation p41, .--, Sptq: Si: + + -» &p) May be obtained’ 
from the vectors (&, . - -, Sp+q) fy means of pg successive transpositions. The- 
vector §p3; May be moved to the first place using p transpositions. Using the 
same number of transpositions we may then move to the second place the vector 
SEp+., and so on. In all, we shall move p vectors, using p transpositions each time, 
i.e. the number of all transpositions is pg. In this case anticommutativity is 
implied by the antisymmetry of the external product. 
(3) Associativity. If oP € Ap (V), OF € Ag (V), @7 € A, (V), then 


(oP A oF A oF = oP A (0% A 07). 


Proof.t Let o € } 4.9176 Consider 
Oo= Ss) (sgn 6) o[wP (Ey, re) E>) wg (Spay; osey Epsq) @* (Spigsy- acey Spaigsr)]- 
CG 
(7.51)- 
The sum (7.51) equals (oP A o%) A * if we first sum over all permutations- 
that leave unaltered the numbers p + q-+ 1, P +~qt2,....ptarr 
and satisfy condition (7.46) and then sum over all permutations that preserve 


the resu'ting order of the first p + q independent variables and the order of the- 
variables Gnso4i1 +--+ Sptq¢re 
Similarly we can obtain. oP A (of A or). 
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We show that in both cases we obtain a sum over all permutations satisfying 
the conditions 


o(ij<to(2)<...<9(p), 
a(p+i)<a(p+2)<...<0(p+g), ! 
O(ptqriy<m... <Cao(ptg+r). 


To do this we turn again to our vehicle-column model. Suppose there are 
three columns of vehicles moving along the road, p vehicles in one, g in another, 
and rin the third. One of the ways to rearrange the three columns into one is to 
first merce the first and the second column and then join the resulting column to 
the third. Alternatively, the second and third columns may first be merged and 
the first joined to them. Obviously the permutation o resulting from cither of 
these rearrangenients satisfies condition (7.52) and conversely cither permutation 
satisfying condition (7.52) may be obtained using cither the first or the second 
way of rearranging the vehicles, This just means that (w? A 2%) Aw? and 
w? A (wt A or) coincide. 

the associativity of external product enables one to consider any finite 
product 


@, A Or A... A Om, Where OE Ap, (V). 


(7.52) 


Exampe 1. Let ay (§), @2 (&), - «1 @m (&) be linear forms. Then 


ay Nas f\... A am= » (Sgn 6) o fa; (E,) a2 (Es) «2. @m (Sm), (7.83) 
G 


where the summation is taken over all permutations o € \) n- 


The equation is easy to verify by induction. Notice that if we introduce the 
matrix {a,; (€;)}, then equation (7.53) may he rearranged in the form 


(aq, \ az -e- A am) (Si Soe cece Em) = det {aj (Sy)). (7.54) 


78.1.7. Basis in space of skew-symmetric forms, Choose some basis (cites 
in a space V and denote by (c)P_, the conjugate hasis in L(V). Recall that 


e' (&) is a Jinea. form which on the elements of the basis {e,} takes the value 


e () = O17. 
n Section 78.1.3 we showed that all possible products 


~ ta ye 4 

eft (51) c*2 (Ba) 2. & ? (Ep) 
form a basis in Lp (V). Since Ap (VY) C Lp (V), every skew-symmetric p-form 
may he decomposed uniquely as a linear combination of those products. How- 
ever, they form no basis in Ay (V), since they are not skew-symmetric p-forms, 
i.e. do not belong to A p (V). Nevertheless we can construct from them a basis in 
Ay (V) using external multiplication. 

Theorem 7.11. Let {e,}P., be a basis ina space V, and let {ef}Piy be the con- 
jugate basis In L(V). Any skew-summetric p-form @ € Ap (V) may be represented, 
uniquely, as 


, { “~ 
o= = Onis Apt Ae tA... Ae? (7.59) 
Shy. ipsa p 


Each term on the right stde of (7.55) {* a produet of a constant OF Teed p bya 
tker-symmetrie p-form ef A eft Ae» A ep, 
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Proof. By virtue of the results of Sec. 78.1.4 we may write 


n nr 
_ WV ~~ toi t 
= me “ee — isis. . ipe te? coe & P, (7.56) 
1— 


where the numbers ity. i, = 2 (eit, ef2,..., e' P) are determined uniquely. 
Since the form  (%, £5, ..., Ep) is skew-symmetric, for any permutation 


c€)'p 
© (§(1)2 Sole): -++1 Satp)) = (sgn c) w (Ey, Eo, ».-2 Sp): 
Hence 


Pi cptata) esetg¢p) = (sgn 9) Oite.. ty? (7.57) 


Group together the terms in (7.56) that differ in permutation of the indices 
ty, ig, . . -» fp and use equation (7.57). We get 


e'0(1) wes 210 P) 


i a) » O; ae | 
iy<to<.. -<ipn Go o(1) ot Pp) 


i i 
= s Oia. ipl (sgna)e 2) .,, 6 OPI), (7.58) 
ycie<.. -<ipn 0 , 
_ By virtue of the example in Sec. 75.1.6 the sum in square brackets is 
et A et A... Ae*?. Thefproof of the theorem is complete. 

Corollary 1. Theelementse" \e*® A... AN CPUS <ig<... in <n) 
form a basis in Ap (V). That basis is empty for p > n and consists of one element if 
p=n. 

Corollary 2. The dimensionality of the space A (") is CR. 

In what follows we shall assume as a rule that the chosen basis ¢,, ¢2, . - -» €n 


is fixed and designate the linear forms ef (&) as e? (&) = &*. Then any form 
w € Ay (V) becomes 


(51, Sa. s+) Sp)= > CO Aue. A EbP. (7.59) 


om 


=<. ae <ip 
Example 1. 


EL A E2= (el Ae?) (Ey, Es)= S1 (sgn.o) a fet (By) e? (Ea)] = ; 
94 


=e! (§1) €2 (So) — et (5p) e? (E:) = ELE3 — EXE}, 
where gd is the jth coefficient in the expansion of a vector —,; with respect to the 
basis {e;}. « 
Example 2. 
ELA Ez A... A Er=det (E7}, 
ns, 
where &; = >: S{ey- poe ‘ 


~— 


j= 
75.2. Differential Forms . 


78.2.1. Definitions. Consider an arbitrary open domain G of an n-dimensional} 
Euclidean space £”. The points of G are designated z = (2, z3,..., 2"), y= 
= (y!, y*, ..., y™), and so on. 


14—01684 
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erates A differential form of degree p defined in Gisa function (x, =, 
«) Sp) which is a skew-symmetric p-form in Ap (E%) for each fired, r€ G. 
"The wet of all differential p-forms in G is denoted by Q, (G) = Op (6, EF), 
We shall assume that for fixed &,....€p € EM a p-form’ misa fanetion infi- 
nitely differentiable in G. Using the results of Section 78.1 we may write every 
p-form © as 


eee 


~ { 
@ = ~ Onset En A LL. AEP. (7.60) 
n<. 0. <ip P 
In what follows the vector § will be designated os dr =(dz', dx*, ..., dr”) 
and the vectors €), as d,7 == (d,2", di" + oy Ox"), We chouse as a basis in £? 
the vectors «, = {0,0,..., 1, 0,. ,' 0), where the unity is jin the kth place. 


Elements of the conjugate basis are the functions ef (t) == ce! (dr) defined by 
the equations 


ek (dz) = drh, 
Then the differential form (7,60) becomes 


(2, dyt, een Apt)= Sy g(a det A. Adz? 
ic <ip P 
Example 1. A differential 0-form is any function defined in a domain G (and, 


by virtue of our assumptions, infinitely differentiable in 6G). 
Example 2. A differentiable 1-form is 


1 
w@ (x, dz)= S) wy (z) dzk. 
h=1 
Jn particular, when n = 4, o (z, dz) == { (x) dr. A differentiable form of degree 
is also called a linear differentiable form. 
Example 3. A differentiable 2-form is 
@ (Z, dy2, d,z)= S) Wyn (z) dzt A dah. 
4<h 
By definition 
dx! A dzk=(el A ek) (dyx, dgz)=e! (dx) ek (dz) —e! (dz) eh (dz) = 
d,zt dyazh 


= i — i — 
=d,z! dgzh—d,z! dyzh = dazt dazh|' 


In particular, for, n = 2 we get 
dyzt dx? 


w (zy) dit, dez)=/ (2) |G, Sa]. 


The determinant jis equal to the area element corresponding to the vectors d,z 


and dpz. 
In the case where n= 3, denoting mw, = R, Wo, = Py, y3 = —Q, we get 


P Q R 
wos PP dzt \ dOi—Qdz A dA+-Rdz' fA dz? = }dyz! dyz* dyx5]. 
dyx' daz? dyx 
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Example 4. A differential 3-form in three-dimensional space is 
d,z1 d,z* a,x 
@ (x, d,z, dax, dgx) =f (x) dz! ( dz? A dz?=f (x)| doa! dox® daz? |. 
daz dx? az 
The determinant is equal to the volume element corresponding to the vectors 
a3, dor, doz. 


78.2.2. The exterior differential. 


Definition. The exterior differential of a p-linear differential form @ E Qy (G) 
is a form dw € Qp+1 (G) defined by the relation 


do= ») dWis.. i A dzit A... Adz'?, 
ty< 0. <dyy 
where 
2 do 
q1.. tp 
dos, in py axk dxk 
k=1 
Thus, if 
* 2 
Q= > Oi. ap ae A eee A az P 
41< -<tn 
then 


° 001, eee in ; ‘ip 
da= > S) —Zar ath A dzit \ ... A dz . 
h=1 i<...<ip 
Example 1. The differential of a form of zero degree (i.e. of the function — 
f (z)) has the form 


of 
df (x)= 5 Zan at. 
k= 1 
Example 2. Evaluate the differential of the linear form 
ri 


O= (z, dz) = S) w; (z) dzt. 


— 
— 


We get 
non 
OW; (x 
dw=do (xz, dyzZ, tat)= Dd, 2 “er {2} dzk f dzt. 
Since dzk A dct = —dz* A dzk and drk A drk = 0, we have 
dM; i 0M; 
do = Dy oe C2" A az +2 sot dak \ dat = 
h<i 5 
= >) Oe I01 dzk , dzt— Pim —* aah A dzi= 
k<t 
OO; —t <) h . 
=), ( er Oak det A dat. 
k<i 


14* 
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In particular, when n = 2 we get for o = P dr + @ ex? 


— (22 oP 2 s 
d= (<= =) az ‘\ dz-. 


78.2.3. Properties of exterior differential. The following properties follow 
immediately from the definition 

(4) if w, EQ, (G), M2 € Np (G) then d (w; + @.) = dw, + ds, 

(2) if w € Gp (G) and 7 isa real number, then ¢@ (2.0) = 2. do, 

(3) if m, EO, (G), w, € QO (G), then 

d(o, A O-)=—do, (A ©.+(—1)P ©, A doz. 

We prove property (3). Let 

Sey. a det A ee Ade? 
W< eee <ty, 
We introduce the following notation: 


O= Lee. 


dM; 


dw i1...dp i 4 
= Se ax 1 eevee = P 


i<...<ip 
Then dw may be written as 
- 7 
do = >) dk No. 
h=1 
Remember that 
— O= 0, A O,=(—1)PTo, A oO}. 
Further 


G0 GW, r J), O00, ; OW. 
agh aah NO + Or AN sar aoe NN O27 (— Pd a A Oy 
Then 
re 7 n , 
GO) 0G) 
— hk —— } 1 - 

da = Dy dt A, Dy atk AGH A @2-+ 

h=1 =f 

n ~ 
dW. 

~t(— 1)Pa » dzh \\ ah A O1 = do, A @2+(—1)Pl dos A wy. 


k=] 
Since dw. is a (q+ 1)-form, we have 
dos (\ Oy =(—1)P4) @, A dis. 
Hence dw = dw, A @. + (—1)P ay (ats. 
The following important property offthe differential is true. 
The baste property of the exterior differential: 
d (dw) = 0. 
Proof. Suppose first that w is a form of deerce 0), i.e. © (zr) = f (2). Then 
nN re ce ary 
d(dfy=a 2 —p ae = oo at al 
jx 


ark art 
netic, Oto 
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Since dz* A dzt = —dzt A dzk, this equation may be rearranged in the form 


aan=> (<p —) att nat, 
i<kh 


dzt Axk = Ack Oat 


hence d (df) = 0. 


Now let 
{ 
o= pa Oi. ip Pt tN vee Adz p, 
i<..-<in 
Then 


TN 
do= SS doy Adit A... Ade? 
ho=1 43<...<ip 
Notice that each term of the sum is the external product offthe differentials 
of 0-forms, i.e. of forms Wj,...i) (2), e!* (dz), -. -, e*? (dz). It remains to apply 
property (3) and take advantage of the fact that for the form of degree 0 the 
basic property has been proved. 


78.3. Differentiable Mappings 


78.3.1. Definition of differentiable mappings. Consider an arbitrary m-di- 
mensional domain D of a Euclidean space E™ and an n-dimensional domain 
Gc E®, Points of D are designated ¢ = (2), ¢?, ..., 2) and those of G are de- 
signated z = (21, x7, ..., 2%), 

We shall say that m maps D into G if 

p= {P,P - ++ OM) 
where gh (¢) are defined in D and vectors x with coordinates xh = qb (t) lie 
in G. 
We define a mapping 9* transforming Q, (G) into Q, (D) for any p,O< pe 
<n. We shall assume that each component g* (t) of a mapping @q is infinitely 
differentiable. 

Definition. Let @ be a mapping of DC E™ into Gc EN", We denote by 9* 
the mapping which for allO < p < n acts from Qy (G) into Q, (D) by the following 
rule: if 


i 
o= DS yap tt A Ade? 


t41<...<ip 
then 
OF (o)= Dy, OD) O* G2) A... A ot (az*?), 
ii< eee <ip 
where 


™ 
- yi 2e 
or (dxt) = Oth, dtk, 
k=1 
Example 1. Let be a form of degree 0, i.e, w = f (x). Then 


9* f) =7@ @)). 


Example 2. Let p map an n-dimensional domain D c £7 into an n-dimension- 
al domain G c E", and let w be the following n-form: 


@ = dz A dz? ... A dz. 
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Then 
- og? - d 
q h pn k 
hd () = { —— dl \} eee ( Tain ) = 
4 ( ) > ou rN A » arin 
ky k=l 
- aq a 
— fy q h t _ 
=) > ah at A... Adt t= 
fkyeth,=t 
0 agt 
— 1 ee ee 
=dt! A... A din » (sen c) moth) "°° Gatny 


ag 
=dliiA ... A ditdet {—-}. 
A A ae] 


Thus 


D (pl, pt, ..., PY) ” 
* 1 2 ny — Se et te gd 2 fi 
p* (dz1 A dz? A... (A dz") Di, 1, 7, ditt A dt? fA... A dt". 
Remark. The form g* (q@) is a differential form obtained from a form @ by 
changing variables 
78.3.2. Properties of mapping ¢p*. The following properties of the mapping 
* are true: 
(1) If m, € Qy (G), Wg € Qg (G), then 


p* (@, A Ms) = g* (1) A p* (9). 
Proof. Let 


i 
O= SY aga, att A. Aer? 
i9<...<ip 
_ . } h 
Or= Nba ie (dBA Adz? 
hiss. echy q 
Then 
. 
QO, \ O.= s" > Cedi (z) Phi, akg (z) x 


i 
Xd A... dr’? Adz? A... Adz? 
and hence 


* (o, A @2)= MN a; (ep (0) On (4 (0) 9° (dz!) \... Age (dr 2) = 
ieok 


= NV ay g) gt (aztty Ao. A ot de?) ALS bn (a) gt dz") A 
i hk 
--+ AG? (dz) == * (3) A p* (@;)- 
(2) If o€ Qp (G), then 

P* (dw) =dq* (w). 
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Proof. We first prove this equation for p = 0, i.e. for w = f (xz). We get 
n 


7] 
do = » f dz', @* (o)=f (9 (t)), 
Oz 
i=1 
™ 3 ™ n af aot 
— — P _ 
dp*(a)= Di ari @M)dth= >) Dae a= 
= of p* (dz!) = o* (dw). 
azt 


For an arbitrary p, we carry out a proof by induction. Let Of... p (z) dz*t A 
A... dz’? Then do= Uftynip A dz? A... A dz*?. By property (1) 
and the just proved relation 

i 
p* (do) = 9* (df) A 9* (dz"t) A... A ot (dz ?). 
On the other hand, 
dep* (o) = dq* ((fdzt A... A dz'P-t) A dz‘? |= 
=d[p* (f dz A... A dz'?-1) A o* (dz'P)). 
Further, by property (3) of the exterior differential 


dp* (a) =do* (f dz¥f A... Adz*?-2) A pt (dz?) + 
4 (—4)P-1 p* (f dtl A... A dz’ P=1) A do* (dz*?), 


Notice that m* (dz'?) = do* (z'?) by what has just been proved, and then 
by a basic property of the exterior differential dp* (dz*?) = 0. 

Under the induction hypothesis true for p — 1 

dot (f dz A... A dz*P-1) =o" (df A dzt A ... A dz'P-2). 


As a result we get 


de* (wo) =9* (af A dztt A... Adz?) A g* (dz) 
and by property (1) 


dp* (wo) = @* (df A dz!f A... Adz’). 

The next important property is transittvity. 

(3) Consider open domains Uc E!, VO E™, Wa E® whose points are 
u= (ui, u*, ee og U ); v= (v, v*, ce ey ym), w= (w', w*, oe 0 ey wt), respectively. 
Let p map U~ V and let y map V — W. By xp oo denote a mapping, called 
composition, which acts by the rule 


(tp og) (uz) =  [— (u)). 


Similarly we introduce the composition m* o%* which for any p transforms 
Q, (7) into Qy (U), i.e. 


(p* oxp*) (wo) = o* [y* (o)). 


/ domain Dc 


. 216 . 


The following Cquation is true: . 
(Pog) = oF oye, 


Proof. We denote p — og. This means that B = (Bl, pr 


BA 5 ye (7, q,.. +» M%), oe Br), When 
We first construct a pr = ae 
peer ge linear form dwk & Q, (W): We get 
; . A ge . Pe 
p* (duh) = dpe (wh) — dBA (u) ropa: - dYA doi 
cs = —— du! = — 9— 
a dul 2 2 dv} aul dué 
ms imi juni we ‘ 
Next : 


(OP oY") (Aah) = 9° [p* (om)] = 9* [ay* (wh)] — G* (aH) = 
™ ii ; 
= @~* ayk ey Oy 
9 (> ae dy3 ) =2 rer p* (dv). 
But | 


t @ 
O° (dv) = dp* (vi) dpi = S\ 2%" ays, 
ium{ 


Gu 
and then 
m 1 ee 
aq? 
* 0 wb*) (dh) = i i dut 
(p* 0 1p*) (dwh) am rr 


and thus the equation is proved. It follows that property (3) is true for any 
linear form. The proof proceeds by induction. Let 


w=f (w) dw't;\ ... A dw’? Qn (MW). 
Then 
B* (wo) = B*(f dw! A... A dw'P-) \ Be (dw'?) = 
=(p* eH) (fd! A... Afdw'P-1) A (g* 0 H*) (dw'?) = 
= (9% 0 Wp) (fdwo!t A... A dw'P) =(q* ow*) («). 


78.4. Integration of Differential Forms 

78.4.1. Definitions. Denote by J™ a unit cube in a Euclidean space 5”: 
m= (€E™ Oct<i,i=1, 2,..., m). | 
By a mapping 9 of a cube J™ onto an n-dimensional domain G c &* we shall 


es «os } by 
ing onto G of some domain D c E™ containing J™. Similarly 
y oe ceaiial dora @ defined in J™ we shall mean a p-form defined in 
fm containing J™, 
Definition 1. The integral of a p-form 


o = f(t) dt A d® A... A ate 
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IP over IP is 
defined in — ; 
[ o= \ uae f(t) dti dt? ... dep, 
yp 0 0 
Our immediate goal is to define the integral of a differential form over any 


Definition 2, Let us mean by an m-dimensional singular cube in a space 
B® (m < n) a differentiable mapping of a cube J into E®, Denoting a singular 
cube by C we may thus write 


C= 9: J™ + EN, 


We shall say that a singular cube C is contained in Gc E*ifpuUm™ec G. 

Now we can define the Integral of any p-form w € 25 (G) over any p-di- 
mensional singular cube C c @. 

Definition 3. The integral of a form oc Qy, (G) over a singular cube C = 
= 9: IP E® contained in G is 


\ o= | q* (o). 
Cc P 


mean by a surface only the image of a- cube. . 


We shall show that the integral of a p-form over a p-dimensional singular 
C depends only on the image @p (JP) rather than on the law of correspon- 


We first ider j : me : 
diecalare the €r im more detail the definition of the integral of w over a 


_ Leow 2p (G) be of the form @ = F(z) dz A... A dz’? then ~* (w) = 
=f [y (?)) p* (dz A -++ A dz), By virtue of Example 2 in Section 78.3.4 
| 


d an 4 
Po)=7 (9 ee Ton ee ae dt! Adi? A... A aup, 
Hence Pn Og wang ) 


1 1 
Jo~ frei Bete ce 2) aa ses AaB, 
rp 


Definition 4 : 
Cubes. Wy, + Let Cg: IP +» EN and C, = oy: IP + En be ¢ i 
ttsety such atl say that C, = Cy tf there is a one-to-one mapping t of ecube THe 


(1) Dee et Oh 
(: ’ 3 o 8 0 
7 D (1, = Pee, my >O. 


Xt ig ¢} 
“Atlety Car that if ¢ = C,, th —_— ff o . 
leg VE shalt gPPTOPTiate requirements, “1 Ce the inverse m 
= pe (obvi Gt - —¢; in condition (2) the J acobian js eve | 
2 differ in orientation, 2) It is sometimes said in this case thatee 
tr 
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The following statement is true: if C; = C., then 


fon fo. 


Cy Ce 
Proof. We give a proof for the case where 
w = f(r) dz! A dz? f\... A dzP. 


By defhnition 


Y b, Ts eee) GE 
\ QO = | fs) pag det A. A ate, 
Co rP 


As stated there is a mapping t of a cube 1P onto itself satisfying conditions 
(4) and (2). 
We make in the integral a change of the variable ¢ = t (s), s€ TP. We get 
G2 (t) = po {t ()] = G1 (5), 
D (qh, ph, ..., PP) D(t?, t?, ..., TP) 


J o= J ste) Gg ee a 
Co jP 


dstAds?h(...Ads?= 


Dis), 2.0, SP) 


p 
= | f (7, (s)} Pay 2221 93) 3 A see A ast = | o. 
1P Cy 


Similarly we could show that if C, = —C,., then 


{ o=—|o. 
Cy Ce 
78.4.2. Differentiable chains. We shall need surfaces that split into several 
ieces, each being the image of some m-dimensional cube. The two circles of the 
oundary of a two-dimensional ring may serve as an example of such a surface. 
We shall distinguish between the orientations of these circles. It appears very 
useful in this connection to introduce Jinear combinations of singular cubes wit 
real coefficients. 
Definition 1. A p-dimensional chain C is an arbitrary set 
(Ps, day seer Ane Cay Cay . oy Ch), 
where }, are real numbers and C; are p-dimensional singular cubes. We shall use 
the notation 


C = AC, + eee + Ay Che 


We shall say that C is in G if all Cy are in G. 

The set of p-dimensional chains will form a linear space if we introduce in 
a natural way the operations of addition and multiplication by real numbers. 

Definition 2. The integral of a form w over a p-dimenstonal chain C,contatned 
in G ts 


{ W = 2, ( W-+- 2. \ Ade ce of A { W. 
C Cy Cag Cy, 


Now we can define the boundary of an arbitrary singular cube. To do this 
we first define the boundary of a unit cube. 
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Definition 3. The boundary of a cube I? is a (p — 1)-dimensional chain 
Pp 
ai? = 3’ (—1 UP O-2 Oh 
i=1 


where I® (1) ts the intersection of the cube IP with the hyperplane zri=a,(«¢= 0, 1). 
For this definition to be correct, it is necessary to make clear how we interp- 
reted the statement that J® (1) is a (p — 1)-dimensional singular cube. 


We construct the canonical mapping 9 = @e'Pof a cube [P-1 on JZ (i). Let 
s = (s!, s2, ..., sP-1) € IP-1. We set 


sk if4<ck<i, 
pe o=4 a if k=i, 
skh-l ifi<k<p. 
Obviously @ = (p', m2, ..., @P) maps JP-1 in a one-to-one manner onto 
TF (i). For a = 0 and t= p in particular the mapping @ is the restriction to 
IP (p — 1) of the identity mapping of a space EP onto itself. 


Definition 4. The boundary of a p-dimensional singular cubeC = @: [IP + E™ 
is a (p — 1)-dimensional chain 


p 
ac= S) (— A [eF @)—9 U? @)I. 
i= 
The boundary of the image of JP is thus the image of the boundary of J? 
with natural orientation. ; ; 
Example 1. Consider in the plane a square J*. Obviously this square may be 
considered as a singular cube if we take as p an identity mapping. Figure 7.14 


z?] 
t? \\ 
17(1) I? (1) x! 
— \ 
13 (2) t} ZA 
Fig. 7.44 Fig. 7.42 


shows the boundary of the square. Here the direction of the arrows coincides 
with the direction of increase in the parameter f, with respect to which the 
integration is performed, if this side of the square appears in the chain dJ* with 
the plus sign, and is opposite if the side of the square is taken with the minus 
sign. We see that our convention about signs leads to the usual counterclockwise 
tracing of the boundary. 

Example 2. Consider the singular cube C = p: J2-> R*, where p has the 
orm 


gt = (a - Rt*) cos Anz, 
gq? = (a + Ri) sin uz", 
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Ht is easy to see that @ (/*) is a ring whose boundary is formed by circles cf 
radii a and a + R, We shall show what the boundary of a singular cube C is. 
Obviously @ (/2 (1)) is the circle 

¢? = acos 2x¢*, 

q* = asin 220, 

Further, q (77 (1)) is a circle of radius a + &. Finally, p (7% (2)) and g (Jj (2)) 
is a segment z27>=O0,aca2 cat R. 

In Fig. 7.42 the arrows indicate the direction of circulation about the bound- 
ary 0C if the boundary d@J/? is traced counterclockwise. 

Since g (7% (2)) — gp (7? (2)) = 0, we may assume that 

aC =O (TF (1) — ¢ (75 (4), 
which coincides with the usual interpretation of the boundary of a ring. 

We shall see in what way the integral of a form w along the boundary of a 
cube C and that of a form t, (o) along the boundary of J? are related. 

5) 


Statement. Let C = ~: 1/P + E® bean arbitrary singular cube contained in G 
and lef w€ Qp., (G). We have 


w) = (p* (w). 
éc ayP 
Proof. Obviously, by the definition of the integral over a chain it suffices to 
prove the equation 
\ o= | (p* (@). 
@ (18 (4) HP (t) 


Consider the canonical mapping 9 = 9%"? : Jp-) + 72 (i). By definition 


| ot@= | ator (or. 
IP (4) rp“ 
By property (3) of differentiable mappings (see Section 78.3.2) 
pt o@t = (po gt. 
Thus 
[ qt (@)= ) (po g)t (0) = | o= [ ©, 


a 


IP) yp-t (gop) (PAH) gd) 
since (po q) ([P-1) = @ (72 (i). 


7.9.4.3. The Stokes formula. 


Sain theorem, Let C = qi: TP — E* be an arbitrary singular cube contatned 
in G and let w € Qyy (G). We have the Stokes formula 


| do = lo. 


e e 


Cc oc 
We prove the Stokes formula first for the following special case. 
Let w be a differential form of degree p ~ 1 defined in J?. Then 


( do = [ w. (7.61) 
IP orP 
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Proof. Let w= /f(t)df® A... A dt?. By definition! 


Foe Sem( fo f of, 


aiP i=i IP (i) IP(i) 
Evaluate the following integral: 
\ w, where i=j4, 2, ..., Pp, @=0, 1. 
Fd) 


Consider the canonical mapping 9: [p-1_, J? (i). By virtue of the results 
of Section 75.4.4 


fo= | rem Eh ast q... Nasr, 


WP) PO 
By the definition of the canonical mapping pyr the Jacobian has the form 
_ D(s?, ..., st, a, sf, ..., SP) 
7= DG en RY 
and 


f= D (s!, s2, ..., sP74) 


D(st, 88, sey SP) , 
Thus only the integrals over J? (1) may be nonzero and we get 
\ o=(—1)( | o— \ o)= 
orP 1P(4) IP (4) 
= \ 7 (4, st, s?, ..., sP-) ds! A... A dsP71— 

ye} 


— \ { (0, si, eeey sP-1) ds? A ese A dsp-?, 
ppm 


By the definition of the integral over the cube /P-1 


| 4 
\ = eee | [f (4, st, ...,s?71)—f (0, s1, ..., sP-)] ds? ds? .., dsP-I= 
6jP 0 0 . 
i 4 | af af 
= | \ oan | gor dst dst.., dsp | SE aco ... A aor, 


On the other hand, 


dw = Zt ae A dt? A... A dtP. 
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Therefore 
« ~ 4 
do = | Seat A... A ae’. 
1? yP 


Thus equation (7.61) is proved. 
Proof of the Stokes theorem. By the definition of the integral over the sin- 
gular cube 


[ deo = | (dw). 
C 1P 
By property (2) of differentiable mappings (see Section 7S.3.2) 
| @* (do) = \ dq" (ov). 
7P 1P 
Next we use the already proved Stokes formula for the cube JP 
fast wo J oto) 
1P oyP 


It remains to notice that by the property of the integrals along the boundary 
of a singular cube (see this at the end of Section 78.4.2) 


¢* (a) = [o. 
oP ac 
This completes the proof of the theorem. 
78.4.4, Examples. (1) Consider the case p = 1. A one-dimensional singular 


cube C in £" is some curve whose end points are denoted by a and b, The Stokes 
formula takes the form 


fag = (f=f(0)—f(@). 
C 5C 

In particular, when n = 1 we obtain the Newton-Leibniz formula 
b 


\f' (2) dz= f ()—f (a). 


(2) Now let p = 2. A two-dimensional singular cube C is a two-dimensional 
surface, and a form w € Q, is 


n 
h 


a | 


Using Example 2 of Section 75.2.2 we get 


| » (2-7) iz" A dz‘ = { 3 oy dz". 


C het uC hel 
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If n = 2, then, denoting wm = P dz! 4- Q dz*, we obtain the Green formula 


(ras dx! \ dvi= | P das + Q dz. 
éc 


If n = 3, we obtain the ordinary Stokes formula. 
(3) Let p =n. Then @ € Q,., has the form 


n 


a= Xs wo, dz! \ ... A dat! A da'tt A... A da. 


Further 


ond 52en dz! \ da! \....\ dat= 


k=l i=1 
n 3 , 
= >) (— 1) da! A da? A... Ada. 
k= { 


In particular, for n = 3 
o=P dz “A ae aa A do 4Rdx\ A dz, 


om (et 4 RR) dat aad NO 


and we obtain the Ostrogradsky formula. 


‘CHAPTER 8 


THE LEBESGUE INTEGRAL AND MEASURE 


In Chapter 10 of Volume 1 and in Chapter 2 of this volume we 
studied the Riemann integral of a function of ono and rn variables 
respectively. The concept of Riemann integral covered a_ class 
of functions that are either strictly continuous in a domain under 
consideration or close to continuous functions (the sot of whose 
points of discontinuity has an n-dimensional volume equal to zero). 
This notion turns out to be insufficient in a number of fundamental 
branches of modern mathematics (in the theory of generalized func- 
tions, in the modern theory of partial differential equations and 
SO On). 

In this chapter we present a theory of a more: general integral, 
the so-called Lebesgue integral*, for which purpose we develop as 
a preliminary a theory of measure and the so-called measurable 
functions (which are a broad generalization of continuous functions). 

The basic idea of the Lebesgue integral distinguishing it from 
the Riemann integral is that in composing a Lebesgue integral 
sum the points are combined into individual terms not by the prin- 
ciple of their closeness in the domain of integration (as was in the 
Riemann integral sum) but by the principle of the closeness of the 
values of the integrablo function at those points. It is this idea 
that allows the notion of integral to be extended to include a very 
wide class of functions. 

It is to be noted that many mathematical theories allowing inter- 
pretation of the integral in the Riemann sense assume a more com- 
plete character ~when the Lebesgue integral is used. An oxainple 
of such a theory is the Fourier scries theory presented using the 
integral in the Riemann sonso in Chapter 10 and the Lebesgue integral 
in Chapter 11. 

Throughout this chapter the presentation is made for the case 
of a single variable but can be oxtended without difficulty to any 
number n of variables (an appropriate romark is made at the end 
of the chapter). 


* Henri Lebesgue (1875-1941), a French mathematician. 
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8.4. ON THE STRUCTURE OF OPEN AND CLOSED SETS 


We shall consider an arbitrary set £ of points of an infinile straight 
line (—oo, ©). 

The complement of £ is the set designated CZ and equal to the 
collection of those points of the infinite straight line (—oco, oo) 
which are not in £. 

If we regard as the difference of sets A and B the collection of 
those points of A which are not in #2 and denote the difference of 
A and B by A \_ B, then the complement C£ of / may be repres- 
ented as 


CE = (—oo, o)\ EL. 


We recall some definitions introduced in Volume 1. 

1°. A point x is said to be an interior point of a set EF if there is 
some neighbourhood of x (i.e. an interval containing that point) 
lying entirely in £. 

In what follows an arbitrary neighbourhood of x will be designated 
v (x). 

2) A point z is said to be a limit point of / ifin any neighbourhood 
v (x) of x there is at least one point z' of F distinct from zx. 

3°. Aset G is said to be open if all the points of that set are interior. 

4°. A set F is said to be closed if it contains all of its limit points*. 

Let us agree to denote the collection of all limit points of an arbi- 
trary set Z by E' and the sum or union of two sets A and Bhy A+B 
or A{j B**. Let us also agree to give the name of closure of an 
arbitrary set & to a set designated # and equal to the sum Z + E’. 

It is obvious that for any closed set Ff we have IF = F. 

The collection of all interior points of an arbitrary set £ will 
be designated int L***. 

It is obvious that for any open set G we have intG = G. 

For an entirely arbitrary set # the set int is open and # is 
closed. 

Remark. One can show that int /& is the sum of all open sets 
contained in & and that F is the intersection**** of all closed sets 
containing Z. Thus int £ is the largest open set contained in E 


and £ is the smallest closed set containing £. 
We shall discuss the simplest properties of open and closed sets. 
1°. If a set F is closed, then its complement CF is open. 


* In particular, a set having no limit points is closed (for an empty set is 
contained in any set). 
** The sum or union of sets A and B is a set C consisting of points lying in 
at Jeast one of the sets A or B. 2; 
*** int are the first three letters of the French word intérieur (interior). 
*x4+% The intersection of A and B is a sct of points lying both in A and B. 


45—01684 
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Proof. No point z of the set CF is in /, nor is it (by the closure 
of /) in the set F’ of the limit points of /. But this means that some 
neighbourhood v (z) of z is not in / and is therefore in CF. 

2°, If a set G is open, then its complement CG is closed. 

Proof. Any limit point z of the set CG is a fortiori in that set, 
for otherwise z would be in G and since G is an open set some neigh- 
bourhood v (z) of z would also be in G and would not in CG, i.e. z 
would not be a limit point of CG. 

3°. The sum of any number of open sets is an open sel. 

Proof. Let a set & be the sum of any number of open sets G, 
(the subscript a is not an integer in general) and Jet z be an arbitrary 
point of £. Then (by the definition of a sum of sets) z is in at least 
one of the sets G, and since every set G, is open, there is some neigh- 
hourhood v (z) of x also lying in the set G, and hence in the set E. 

4°, The intersection of any finite number of open sets is an open set. 

Proof. Let a set E be the intersection of open sets G,. G., ...- 
..+, G, and let x be any point of Z. Then for any & (A = 1, 2,... 

. ., 2) the point z is in G;, and therefore there is some neighbourhood 
Vy, (x) = (@ — Ep, T+ &,), &, > O, of x also lying in G,. We = 
= min {€,, €., -.-, €n}, then the neighbourhood v (z) = (x —e, 
x +e) of x is in all G, and consequently in Z£. 

a. The intersection of any number of closed sets is a closed set. 

Proof. Let a set / be the intersection of any number of closed 
sets I’, (the subscript @ is not an integer in general). Notice that 
the complement CE is the sum of all complements CF, each of 
which is, according to 1°, an open set. 

According to 3° the set CZ is open and therefore in view of 2° the 
set £ is closed. 

6°. The sum of a finite number of closed sets is a closed set. 

Proof. Let £ be the sum of closed sets F,, Fa, .... Fp. Then 
CE is the intersection of the sets CF,, CF., ..., CF,, cach of 
which is by 1° an open set. According to 4° the set CE is open and 
therefore in view of 2° the set £ is closed. 

7°. If a set F is closed and a set G is open, then the set F ~\ G is 
closed and the set GN F is open. 

Proof. It suffices to notice that / \ G is the intersection of the 
closed sets / and CG and G \ F is the intersection of the open sets 
G and CF. 

sing the above properties we prove a theorem on the structure 
of an arbitrary open set of points of an infinite straight Jine. 

Let us agree throughout the remainder of this chapter to applu the 
term interval to any connected open set (not necessarily bounded) of 
points of an infinite straight line. In other words, an interval is either 
an open interval a << z < 0 or one of the open half-linesa@ << x < 0 
or —oo <¢ z <b or an entire infinite straight line —coo << z <i oo. 
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Theorem 8.1. Any open set of points of an infinite straight line 
is a sum of a finite or countable* number of mutually disjoint intervals. 

Proof. Let G be an open set and let x be an arbitrary fixed point 
of G. Since G is open. there is some neighbourhood v (z) of x con- 
tained in G. The sum of all neighbourhoods v(x) of the given fixed 
point z that are contained in G will be designated J (zx). We prove 
that / (x) is an interval. 

Denote by a the infimum of the set of all points of J (z) (in case 
the set of all points of / (z) is not bounded below, we set a== —oo) 
and by & the supremum of the set of all points of J (x) (in case the 
set of all points of J (x) is not bounded above, we set b = oo), It 
is sufficient to prove that an arbitrary point y of the interval (a, b) 
isin J (x). Let y be an arbitrary point of (a, b). To be specific assume 
that a << y <2) (the case z << y <b is considered quite similarly). 
By the definition of infimum there is a point y' in J (x) such 
thata < y’ < y. But this means that there is some neighbourhood 
v (x) of the point x we have fixed containing the point y’. By virtue 
of the inequality y’ << y <2 that same neighbourhood v (x) con- 
tains the point y. It follows that J (x) too contains y and that the 
proof that J (z) is an interval] is complete. One may say that J (z) 
is the largest interval containing the point zx and contained in G. 

We now show that if interval J (7,) and J (x.) are constructed 
for two distinct fixed points z, and z, of G then those intervals 
either have no points in common or coincide. Indeed, if J (z,) and 
I (xe) contained a common point xz, then they would be both con- 
tained in J (z) and would therefore coincide. 

Having constructed for each of the points x an interval J (z) 
we now select the intervals containing no points in common (i.e. the 
ones that are mutually disjoint). Such intervals each contained at 
least one rational point (we know this from Chapter 2 of [41]). Since 
the set of all rational points is countable (see Section 3.4.6 of [4]), 
the number of all mutually disjoint intervals J (x) is at most count- 
able. Since the sum of all such intervals is a set G, the theorem is 
proved. 

Corollary. Any closed set of points of an injinite straight line is : 
obtained by removing from the line a finite or countable number of mu- 
tually disjoint intervals. 


8.2. MEASURABLE SETS 


8.2.14. The outer measure of a Set and its properties. The entire 
theory presented in this section is due to H. Lebesgue. Its starting 
point is using as a primary (original) set the interval A = (a, d) 

* Recall that a countable set is an infinite set whose elements can be enumer- 


ated, i.e. brought to correspond in a one-to-one manner to the natural number 
series 1, 2, 3, ... (see Section 3.4.6 in {1]). 


15 * 
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whose Jenglh or measure is assumed to be known and equal to 
a number |Af{=&—a>0. 
Let £ be an arbitrary set on the number line. A corcring S = 
= § (/) of FE is any finite or countable system of intervals {.,} 
whose sum contains the set £. The sum of the lengths of all the inter- 
vals {A,} constituting a covering S$ = S (£) is designated oa (S). 
So 


o(S)= » [Au]<o. 
nr é 


Definition. The outer measure of a set FE is the infimum of o (5S) 
on the set of all coverings S = S (FE) of EF. 

The onter measure of a set £ will be designated | £ |*. So by 
definition 


[i {* = inf a(S). 
S(E) 


Obviously the outer measure of any interval coincides with its length. 
We now show the basic propertics of the outer measure. 
1°. If a set E, is contained in E%, then | E, \* <1 E. |. 
To prove this it suffices to notice thal any covering of L, is at 
the same time a covering of £;. 
2°. If E isthe sum of a finite or countable number of sets {F} (in 


symbols If = (§ E,), then 


I= 


[EI"< BIE, A. (8.1) 
k=f 


Proof. Fix an arbitrary ¢ > 0. By the definition of the measure 
1 £, |* as the infimum, for every /& there is a covering S$, (E}) of 


a set £, by a system of intervals {Ak} (nm = 1, 2, ...) such that 
x [An <IEr |" + op. (8.2) 


Denote by S the covering of the whole of £ combining all the 
coverings Sh. (is 41, 2, ...) and consisting of all the intervals 
{A} heed. 2, 0003 n=1, 2. ...). Since S$ js a covering of £, 


we have {Z|*<o(S), but g(S)= x x [A*]. 
hoo n-={ 


* In symbols the fact that a set Ly is contained in E. is designated as fol- 
Iews: Fy co Ea. 


Ch. 8. The Lebesgue Integral and Measure 229 


From the last two relations and from (8.2) we get 


|E\"<D) (1Enl* +57) =D! 1Enl? +e. 

k=1 k=1 
Thus inequality (8.1) is proved. 

Let us agree to regard as the distance between sets #, and E, 
the infimum of the distances between two points of F, and £, re- 
spectively. 

We shall denote the distance between £, and Ey, by p (Ey, £,)- 

3°. If p (Ey, E,) >0, then | £,U £, [* = | #, (* + [ £, IF. 

Proof. Set 6 = (1/2) p(£,, E,). For an arbitrary ¢ > 0 and a chosen 
5 > 0 there is a covering S (£) of E = E,\) E, such that o (S) < 
< | |* + and the length of each interval of the covering | A, | 
is less than 6*. It is obvious that the intervals A, covering the points 
of E, contain no points of &, and conversely the intervals covering 
the points of Z, contain no points of £,. In other words, the cover- 
ing S (£) falls into the sum of two coverings S (Z) = S, (£,) + 
+ §, (E£,), the first, S,, covering £, and the second, S,, cover- 
ing Z,. So we get . 

S, (Ey) +S» (£2) [E|’ +e. 

It follows that | 2, |* + | £ 


3 


I! < | & |* +e and therefore (by 


the arbitfariness of c) | £, |* + | 2. |* < | £ |*. Since on the basis 
of property 2° the opposite inequality |Z |* < | £, |* + [| £. |* 
is also true, we have |Z |* = | £, |* + | £. |*. Thus property 


3° is proved. 

In particular property 3° is true if £, and £, are bounded, closed 
and contain no points in common. 

4°. For an arbitrary set E and an arbitrary number ¢ > 0 there 
is an open set G containing E such that |G [FF <[£ {[*¥ +e. 

Proof. It suffices to take as G the sum of all the intervals consti- 
tuting a covering S (£) of F for which o (S)< |£E |* +. 

8.2.2. Measurable sets and their properties. 

Definition 1. A set E is said to be measurable if for any positive 
number e there is an open set G containing E and such that the outer 
measure of the difference G\ E is less than e. 

The outer measure of a measurable set E will be called the measure 
of that set and designated | E |. 

It follows from this definition that the measure of a set E is zero 
if and only if the outer measure of that set is also zero. 


* This follows from the fact that given arbitrary « > Oland 6 > 0 there is 
a covering S (Z) of E such that o (S) << { £J* + € and [ A, | < 6 (for each 
A, of S). To see this it suffices to take a covering S’ for which o (S') << | E{* +: 
-++ £/2, divide each interval of S' into intervals of length smaller than 6 and 
cover the ends of these last intervals with intervals the total sum of whose 
lengths is Iess than ¢/2. , 
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We prove a number of statements revealing the basic properties 
of measurable sets. 

Theorem §.2. Any open set is measurable, its measure equalling 
the sum of the lengths of the mutually disjoint constituent intervals. 

The proof is trivial (it.suffices to take G = £ in the definition 
and notice that the infimum of o (S) is attained on the covering § 
coinciding with the subdivision of E£ into the sum of mutually dis. 
joint intervals). 

Theorem §.3. A sum of a finite or countable number of measur- 
able sets is a measurable set. 


CO 
Proof. Let E = UY EE, such that every £, being measurable. 
n==l 
Fix an arbitrary ¢ > 0. For every set /, there is an open set G, 
containing it such that 


IGnNEg]|* <e-2™. (8.3) 
On setting G = U G, we notice that the set £ is contained in 


n= 


Gand that the difference G \. E is contained in the sum U (Gan Ey). 


But then from Property 2° of the outer measure (see Section 8.2.1) 
and from inequality (8.3) we get 


IGNE|*< <N ~ IG NEn|* <eE Y 2-0 — 


bo 1 
bee — 


Thus the ‘heorem is proved. 

Theorem 8.4, Any closed set F is measurable. 

Proof. \We prove the theorem in two steps. 

4°. First suppose that the set / is bounded. Fix an arbitrary e > 0. 
According to Property 4° of the outer measure (see Section 8.2.1) 
there is an open set G containing F and such that 


|G JF] F IF +e. (8.4 
By Property 7° of Section 8.1 the sot G \ F is open. By Theorem 5.1! 
therefore G \ fF can be represented as the sum G\ F = JU Ay, 


new f 
of ‘mutually disjoint intervals A,. The theorem will be proved if 
we establish that 


~ ~ 

IGNF|* = “ jAat< (8.5) 
n=! 

For every interval A=(a, 6) and for every number @ in the 


interval ,O<a<(b—a)/2, let us agree to denote by A® the 
interval A*==(a $4, b—a) and by A® the closed interval A® = 
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=(a+a, b—«a]. If, however, «>(b—a)/2, then A® will denote 
an empty set for which |A%}=0. For every n set En= 
nr 


1 __ 
= {J Ap. It is obvious that [Zn |* =’ {AG}. According to Prop- 
k=! Rk==1 
erty 6° of Section 8.1 a set “y is closed. Since it has no points 


in common with the closed set F, we have (by Property 3° of 
the outer measure) 


[Ent P|* = |En|* + [FI (8.6) 
On the other hand, since the set Ht + F (for any @ > 0 and for 


every 7) is contained in G, we have (by Property 1° of the outer 
measure) 


lEo+FI*</G|*. (8.7) 
From (8.4), (8.6), and (8.7) we get 
[Enl*+|FIS<|Fl* +e (8.8) 


(for all @ > 0 and all n). Since J is bounded.and its ouler measure 
{| F |* << oo, from (8.8) we get 


l[EnI*<e (8.9) 


(for all a > 0 and all n). Proceeding in (8.9) to the limit, first for 
a —>0 +0 and then for rn ~ oo, we obtain inequality (8.5). This 
proves the theorem for the case of the bounded set F. 

2°. If the set F is not bounded in general, then we represent F 


[=] 
asthesum Ff = U F,, where F, is the intersection of closed sets F 


w= 
and [—n, nr]. According to what has been proved in the first step 
every /, is measurable (for it is closed and bounded) and therefore 
by Theorem 8.3 so is the set /. The proof of the theorem is complete. 
Theorem 8.5. If a set E is measurable, then so is its complement CE. 
Proof. By the definition of the measurability of #£, for any n 
there is an open set G,, containing £ for which 


IG,NEP <4, (8.10) 


Let. F, = CG,. Since CH, \ CE, = FE, \ E, for any sets] £, 
and £, (check this yourself), CE \. CG, = G, \ £ and therefore 
CEN F, =G, \ E. Jt follows from the last equation that for 
any 7 


CEN U Fy, < GaNE. (8.44) 


(Recall that the notation Z, qc £, means that £, is in £4.) 
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From (8.11) and from Property 1° of the outer measure we have 
for any nr 


ICEN U Fel? SIG ANE|* 
i=} 
and from the last inequality and from (8.10) we have 
ICEN U Fil* <= 
ki 


(for any 7). But this means that the outer measure and therefore 


co 
the measure of the set FE, = CE \ UF», equals zero, i.e. the set 
k= 


CE equals the sum of the measurable sets ly and u Fy (the last 


set is measurable by virtue of Theorems 8.4 and 8. 3), Thus the 
theorem is proved. 

Corollary. For a set £ to be measurable, it is necessary and sufficient 
that given any positive number ¢ we should be able to find a closed sel F 
contained in If such that the outer measure of the difference I N F is 
less than e. 

Proof. The measurability of the set /f is equivalent to the measur- 
ability of CE (Theorem 8.5), i.e. equivalent to the requirement that 
for any « > 0 there should be an open set G containing CE and such 
that |G \ CE |* <e. But that requirement (by virtue of the 
identity CE, \ CE, = E.~\ £,) is equivalent to the requirement 
that for any e > 0 there should be a closed set fF = CG contained 
in #& such that |E NF l*¥ = | CF NCE [> = |GN CE <e. 
Thus the corollary is proved. 

Remark 1. The measurability condition contained in the corollary 
just proved may be accepted as another definition of measurahility 
equivalent to the definition formulated at the heginning of this 
subsection. 

Theorem 8.6. The intersection of a finite or countable number of 
measurable sets is a measurable set. 

Proof. We shall denote the intersection of sets iy, Eo... 


by f) En. By virtue of the identity n E, = CI U CE,,) (check 
: = 
this identity yourself) the theorem heing proved follows immediately 


from Theorem 8.3 and 8.5. 

Theorem 8.7. The difference of two measurable sets is a measurable 
sel, 

A proof follows from the identity A \ Bs= af (CB) and from 
Theorems 8.5 and 8.6. 

We now proceed to prove the main theorem of measure theory. 
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Theorem 8.8. The measure of a sum of a finite or countable number 
of mutually disjoint measurable sets equals a sum of the measures of 
those sets. 


Proof. Let EF = U E, such that the sets £, being measurable and_ 


n= 1 
mutually disjoint. We consider separately two cases. 

(1) First suppose that all £, are bounded. Notice that for the case 
where all £, are closed and there are a finite number of them the 
theorem being proved follows immediately from Property 3° of the- 
outer measure (see Section 8.2.1). 

Now let £, be arbitrary bounded mutually disjoint sets. 

By the corollary of Theorem 8.5, for any ¢ > 0 and for every n 
there is a closed set F,, contained in £,, such that* | £2, \ F, [| e/2™ 
Since all the sets /,, are bounded, closed and mutually disjoint, for 
any finite m, by virtue of the remark above 


1U F, |= 2 IF l. (8.12). 


On the other hand, from the equation £, = (F, \ F,) U Fp it 
follows (by virtue of Property 2° of the outer measure) that | £, |< 
<|2F,™~\ Fy, | +) Fr, | <lfal + ¢/2", so that 


m m™m 
\ [Enl< ™ lFal+e (8.13): 
n= ni 


(for any finite m). From (8.12) and (8.13) we conclude that for any 
finite m 


S Ent<| U Fale. (8.14) 
n=={ n=={ 


Now we take into account the fact that the sum of al] the sets F, 
is contained in Z. It follows that for any m 


| U Fal<lEZl, 


so that (by virtue of (8.14)) for any m 


m 


* {Enl<lZl +e. (5.15). 


fo 
n= { 


Proceeding in (8.15) to the limit as m — 00, we get 
S JEn| SIE -+e 
n=] 


* Since the measurability of all the sets occurring in the proof has already 
been established we may write everywhere simply measure instead of upper- 
measure. 
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and therefore, in view of the arbitrariness of ¢ > 0, 


™~ 


~ |Enl SIE}. (8.16) 
im 


cS 

Now notice that from the equality of the sum UU £,, to the set £ 
n= f 

-and from Property 2° of the outer measure we obtain the inequality 


[EIS © En (8.17) 


that has an opposite sense. From inequalities (8.46) and (8.47) 
we obtain the statement of the theorem being proved (for the case 
‘of bounded sets £,). 

(2) Now let the sets /,, be not in general bounded. Then we denote 
by /% a bounded set Hk = F, 9 (kk —1< [2|[<h) (recall that 
stands for “intersection”). 


co oo 


From the equation EF = U U E* and from the case considered 
n=1 h=1 
Above it follows that 


| oc os oc 
[E[= NN |Ent= XS (£nl. 
={ k=} n=1 
‘This completes the proof of the theorem. 

Remark 2. The fundamental property of measure established by 
‘Theorem 8.8 is called the o-additivity of measure. 

We introduce a new concept to state yet another property of 
measure. 

Definition 2. A set E willbe saidto beaset of the type Ga, if £ can 
be represented as an intersection of a countable number of open sets Gy} 
cand a set of the type /,, if E can be represented as a sum of a count- 
cable number of closed sets Fy. 

Theorem 8.9. If a set if is measurable, then there are a set E, of 
the type F, contained in E and a set E. of the type G4 containing E 
for which | Ey |=] LIJ= Je, 

Proof. By the measurability of E and the corollary of Theorem 8.5, 
for any n there are an open set G, containing Z and a closed set Fy 
‘contained in £ such that 


JE—Fal<s-. (Gn NES. (8.18) 
Set £ =U Fn, £. = A G,. Since for any 1 
not Tes{ 


ENE, CENF,, B,NECG,ANE, 
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by (8.18) and Property 1° of the outer measure 
JEXE<+, [BNEI<+, 


By virtue of the arbitrariness of » it follows that |Z \ £, | = 0 
and |#, \ £ |=0. Thus the theorem is proved. 

Remark 3. Note that there are monmeasurable sets. To construct 
them it suffices to take into consideration that on a unit circle there 
are a countable number of mutually disjoint and congruent* sets 
whose union is equal to the set of all the points of that circle. Such 
are the set E, of all the points of a circle any two of which cannot be 
made to coincide by rotation through an angle of n-a, where n is 
any integer and «@ is a fixed irrational number, and all the sets £, 
obtained from E, by rotation through an angle of n-a. Were E> 


measurable, then so would be all the sets £,, | #, | = | Ey | for 
all integers n. But then by virtue of Theorem 8.8 we would get 2a = 
= > | E, |, which is impossible for any value of | £,, |. 

n= -— 00 


8.3. MEASURABLE FUNCTIONS 


8.3.1. The measurable function. Let us agree to mean by an ezten- 
ded number line the ordinary number line —co < x < oo comple- 
mented with two new elements, —oo and -++oo. To extend the arithme- 
tic operations to include the extended number line, let us agree to 


assume that a + (+00) = -++oo, a + (—oo) = —oo (for any fini- 
te a); (--00) + (+00) = +00, —oo + (—oo) = —on; (+00) — a= 
= +00, (—oo) — a = —oo (for any finite a), (-++00) — (—oo) = 
= +00, —oo — (+00) = —oo; a-(+00) = +o for a>Q0, 

0-(+00) = 0, a-(+00) = —oo for a<0; (+00) (+00) = +o, 
{-+00)-(—00) = —oo, (—00)-(— 00) = +00, 0-(—00) = 0, a-(—00)= 


=—oo for a >0, a-(—o0) = +00 for a <0; +00/a = (-+-0)-1/a 
for any finite a=40, a/too = 0 for any finite a. 

Only the following operations remain undefined: 

Throughout the remainder of this chapter we shall discuss func- 
tions defined on measurable sets of the ordinary number line and tak- 
ing values on the extended number line. 


—oo when r<—1, 
f (x)= 0 when —i<r<ct, 
+oo when z>1 
may serve as an example of such a function. 
~ * In this case the term “congruent” must be taken to apply to two sets one of 


which may be made to coincide with the other by rotation through some angle 
in the plane of a circle. 
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Let us agree to denote henceforth by £ {f satisfies condition A} 
a set of all values of 7 in £ for which f (z) satisfies condition 4. 

For example, £ [f > al is a set of those values of x in E for which 
f(z) >a. 

Definition. A function fj (x) defined on a measurable set E is said tv 
be measurable on that set if for any real number a the set F |f > al 
is measurable. 

Theorem 8&.10. For the function f (x) to be measurable ona set E 
if is necessary and sufficient that one of the following three sets: 


Elf>al, Elf<al, Elf<al (8.19) 


should be measurable for any real a. 
Proof. ({) From the definition of measurability of the function 
J (xz). from the elementary relations 


Ef>al= y El i>a+—], 


n= 1 
Elj>ag= fn E|{>a—-,| 


and from Theorems 8.3 and 8.6 it follows that the measurability 
(for any real a) of the set E [f >a] is a necessary and sufficient 
condition for the function 7 (x) to be measurable on the set £. 

(2) From the relation Elf<aJ=EN ElfSa) and from 
Theorems 8.3 and 8.7 it follows that the measurahility (for any real a) 
of the set F [f <a] is a necessary and sufficient condition for f (x) 
to be measurable on E£. 

(3) Finally, from the relation E(f<ajJ= EN Elf >al, from 
Theorems 8.3 and 8.7, and from what has been proved in (/) it fol- 
lows that the measurability (for any real a) of the set Elf<al 
is a necessary and sufficient condition for f (x) to be measurable 
on £. Thus the theorem is proved. 

Remark. By virtue of Theorem 8.10 the measurability (for any 
real a) of any of the three sets (8.19) may be accepted as another 
definition of the measurability of the function f (z) on a set £ equiva- 
lent to the definition formulated above. 

8.3.2. Properties of measurable functions. 

1°. If a function f(r) is measurable on a set E. then it is measurable 
on any measurable part E, of E. 

The proof follows immediately from the identity £, [f/f aJ= 
= 1, Elf sal] and from Theorem 8.6. 

2°. Ifa sel £ is a finite or countable sum of measurable sets £,, and 
if f (x) is measurable on every set Ey, then it is measurable on the set E. 

The proof follows immediately from the identity Ef aJ = 


O 
= Y FE, (fa) and from Theorem 8.3. 
Ne] 


i) 
qo 
a | 
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3°. Any function f(x) is measurable on a set E of measure zero. 

Indeed, any subset of a set of measure zero is measurable and has 
measure zero. 

Definition 1. Two functions f (x) and g (x) defined on a measurable 
set E are said to be equivalent on that set if the set E [f = g} is of 
measure zero. 

The notation f ~ g is often used to denote functions f (z) and 
g (x) that are equivalent (on a set £). 

4°. If f (x) and g(x) are equivalent on a set E and f(x) is measurable 
on E, then so ts g (x). 

Proof. Put £, = Elf~«el, Ey =~ EN Ey. Since on E, g (z) 
coincides with f (x), g (z) is measurable on E, (by virtue of Proper- 
ty 1°). According to Property 3° g (x) is measurable on Z, as well as, 
therefore, according to Property 2° g (zx) is also measurable on E. 

Definition 2. We shall say that some property A is true almost 
everywhere on a set £ if the set of points of & on which that property 
is false is of measure zero. 

Corollary of Property £.If a function f (x) is continuous almost 
everywhere on a measurable set EL, then f (x) is measurable on E. 

Proof. First notice that if f (z) is continuous on a closed set F 
then f (x) is measurable on F, for F {f > a] is closed for any real a 
and, therefore, measurable. Suppose that f (z) is continuous almost 
everywhere on an arbitrary measurable set HE and denote by Ra 
subset of all discontinuity points of f (x) of measure zero. 

By virtue of Properties 2° and 3° it suffices to prove the measur- 
ability of f (z) on the set E, =Z\R. By Theorem 3.9 there is a set 
E, of the type F, (see Section 8.2.2) contained in &, such that 
|£,|= |£, |= |Z |. By virtue of the same Properties 2° and 3° 
it suffices to prove that f (z) is measurable on E,. But E, (as a set 
of the type /,) can be represented as a countable sum of closed sets J’, 
on each of which f (z) is continuous and therefore (in view of the 
above remark) measurable. But then by Property 2° f (x) is measur- 
able on &,. , 

Remark. We stress that the continuity of a function ¢ (x) almost 
everywhere on a set & should he distinguished from the equivalence 
of f (z) on & to a continuous function. Thus the Dirichlet function 
f(x) = I, if z is rational; f (xz) = 0, if z is irrational and continuous 
at no point of [0, 1] (see Chapter 4 of [4}); but it is equivalent on 
[0, 4] to the continuous function g (z) = 0 since f (z) = g (z) only 
on the set of all rational points of (0, 1] which is countable and is 
therefore of measure zero*. 

8.3.3. Arithmetic operations on meaSurable functions. We first 
prove the following lemma. 


* The fact that a countable set of points is of measure zero follows from 
Theorem 8.8 and from the fact that the measure of a set consisting of a single 
point is zero. 
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Lemma {. (1) Jf a function f (x) is measurable on a set E, then so 
is the function | f (x) |. (2) Jf f (x) is measurable on E, and C is any 
constant, then either of the functions f (x) + C and C+f (x) is measur- 
able on i. (3) If f (x) and g (x) are measurable on EF, then the set 
Ef > g] is measurable. 

Proof, (1) It suffices to consider that for any nonnegative a 


Ellfloeaqd=Elfealu EW < —al 


and use Theorem 8.3. If, however, a <0, then E (}f | > a] coin 
cides with £ and is also measurable. 
(2) It suffices to use for any real a the relations 


El[f+Cpaj=L£ifea=C], 


E [i>+ | when C>0, 
£(C-f>a)= 
nae when C>0. 


If, however, C = 0, then C-/ (x) = 0 and is also measurable. 
(3) Let {r,} be all rational points of an infinite straight line 
(—0o, 00). It suffices to consider that 


Eli>g)]= U (EU > NE le<rpl) 


and use Theorems 8.3 and 8.6. Thus the lemma is proved. 

On the basis of Lemma 1 we prove the following theorem. 

Pheorem 8.11. If functions f (x) and g (x) take finite values on 
a set E and are measurable on it, then each of the functions f (r) ~ 
— g(x), f(z) + g (2), f (z)-g (x), and f (x)/g (x) (for the quotient 
f (z)/g (x) we require in addition that all the values of g (x) should be 
nonzero) is measurable on E. 

Proof. (1) To prove that the difference f (x) — g (x) is measurable 
it suffices to notice that for any real a the set E {f — g >a] coin- 
cides with the measurable (by Lemma 1) set & [f > g + al. 

(2) To prove that the sum f(z) -+- g (x) is measurable it suffices 
to consider that f + g = f — (—g) and that the function —g (z) 
is measurable by Lemma 1. 

(3) To prove that a product of two measurable functions is mea- 
surable we first show that the square;of a measurable function is 
a measurable function. Indeed, if a <0, then the set Eff >a] 
coincides with £ and is therefore measurable. If, however, a > 0, 
then the set Z (f? > a] coincides with the measurable (by Lemma 4) 
set E [| f |>> Val. From the measurability of the square of a mea- 
surable function and from the measurability of the sum and the dif- 
ference of measurable functions, by virtue of the relation f-¢ = 
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= 1/4 (f + g)* — 1/4 (f — g)* it follows that the product f (x) g (z) 
is measurable. 

(4) By virtue of the measurability of a product of two measurable: 
functions, to prove that the quotient f/g is measurable it suffices 
to establish that 4/g is measurable, but this follows from Theorems. 
S.3 and 8.6 and from the relation 


Elg>ainz[e<-—| when a>0, 
E|—> aj=} Elg>0] when 'a=0, 
Elg>01UzZ[e<—| when a<0. 


The proof of the theorem is complete. 

§.3.4. Sequences of measurable functions. We prove several im- 
portant statements relating to sequences of measurable functions. 

Theorem 8.12. If {fn (x)} is a sequence of functions measurable on 
a set E, then both the lower and upper limits of the sequence*™ are func- 
tions measurable on E. 

Proof. We first show that if a sequence {g, (x)} consists of func- 
tions measurable on £, then either of the functions** o (z) = 
== inf g, (x) and (x) = sup g, (x) is measurable on £. It suffices 

n 


nr 
to take into consideration the relations 


E(p<aj= U Elén<al, 


Ely>aj= U Elen >a} 


and use Theorem 8,3. 

Now denote the lower and upper limits of {/, (x)} respectively 
by f (x) and f (x). To prove that f (xz) and f (x) are measurable on E 
it suffices to notice that 7 


f(z)= sup {inf jx (2)}, 7 (x)= inf {sup fa (z)} 


and use the statement proved above. Thus the theorem is proved. 


* In Chapter 3 of [1] we have proved the existence of the lower and upper 
limits in any bounded sequence. Here we agree to assume that if a sequence is 
not bounded below (upper), then its lower (upper) limit is equal to —oo (-}-09). 

** The notation @ (x) = inf g,, (x) means that at every point x the value of 


vT% 
@ (x) is the infimum of the values g; (x), ge (x), .. . at that point. The notation 
 (z) = sup g, (z) has a similar meaning. 
nT 
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Theorem §.13. If a sequence {f, (t)} of functions measurable on 
ia set Ef converges almost everywhere on E toa function f (x), then f (x) 
dis measurable on E. 

Proof. In the case where a function {f, (%)} converges to f (2) 
everywhere rather than almost everywhere on £ the statement of 
the theorem on the measurability of f (7) follows at once from Theo- 
rem 8.12. If, however, {f, (z)} converges to f (x) everywhere on £, 
except the set /, of measure zero, then f (x) is measurable on LF \ £4 
by virtue of Theorem 8.12 and is measurable on £, as a set of measure 
vero (Property 3° of Section 8.3.2) and hence on the set £ = 
= (EF \ E,)U E, (by virtue of Property 2° of Section 8.3.2). Thus 
the theorem is proved. 

Now we introduce an important concept of convergence of a se- 
quence im measure on a given set. 

Definition. Let functions f, (x) (n=1,2,...) and f (x) be mea- 
surable on a set E and take on finite values almost everywhere on E. The 
sequence {f, (x)} is said to converge to f (x) in measure on £ tf for any 
positive number e€ 


lim|E [|f—fn| el] =0, (8.20) 


i.e. if for any positive e and 6 there is N such that when n> WV 
we have |Z (| f —f, | De) |< 6. 

HI. Lebesgue proved the following theorem. 

Theorem 8.14. Let E be a measurable set of finite measure and let 
functions f, (x) (n = 1, 2, ...) and f (x) be measurable on E and 
take on finite values almost everywhere on FE. Then the convergence of 
{f, (x)} to f (x) almost everywhere on FE implies the convergence of 
{fn (z)} to f (x) also in measure on E. 

Proof. Set A=E[|fj[=-+o], 4, =El|f, |= +ol, B= 


=ENEllim f, =fl C=H=A+4+B+ U Ap. Then under the 


N00 n= { 
condition of the theorem | C | = 0, the sequence {f, (x)} converges 
to f(z) everywhere outside the set C and all the functions f,, (z) 
and f (zt) have finite values. 
For an arbitrary ¢« >0O, sot £, =EI|f—f,|oel, Raz = 


co 
= |) E;. Then, since £, is contained in 2,, we have |, |< 
Lon 
< |, |, and to prove (8.20) it suffices to establish that | R, | +0 
as nN —> Oo. 
Denote by # the intersection of all sets R,, Ro, ... to show that 
[?, | —-]R | as n + oo. By construction 72,4, is contained in 72, 
for every mn and therefore for every 


Ri~NR= LU (Py \ Rs), 
Len 
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with the sets under the summation sign mutually disjoint. But then, 
by Theorem 8.8, for every n 


[RaN R= [RN Ravel (8.21) 
and, by the convergence of the series 
ANB) = S \RaNRasil: 


the remainder of the series (8.21) converges to zero as n — 00. So 
[Ra NR | 70 as n + oo. But by virtue of the relation { A, | = 
={|R, \R|+|A| this means that |R, |—>|R|] as now. 

Now, to prove (8.20) we establish that | R | = 0. To do this it 
is in turn sufficient to establish that R is contained in C. 

Let X) be any point outside C. Then for the arbitrary e > 0 we 
have fixed there is N (x, e) such that [f, (2) — f (x) [<e 
when n > N (x, €). But this means that when n > WN (2p, €) the 
point x, is not in #, and, the more so, it is not in R, or in the set 
R which is the intersection of all A,. 

So any point z) which is outside C is also outside R. But this 
means that # is contained in C. Thus the theorem is proved. 

Remark. We stress that convergence of a sequence {f, (x)} to 
f(z) on E in measure implies neither the convergence of {f, (z)} 
to f (x) almost everywhere on /, nor even the convergence of tin (x)} 
to f (x) at at least one point of E. It suffices to consider the example 
constructed in Section 1.2.3. The sequence {f, (x)} in that example 
diverges at each point of the closed interval {0, 1], but since every 
function f, (z) is nonzero only on the closed interval J, whose length 
tends to zero aS n —» oo the sequence {f, (x)} converges to the func- 
tion f (x) = 0 in measure on [0, 4). 

Nevertheless Frigyes Riesz™ proved the following theorem. 

_ Theorem 8.15. Let E be a measurable set of finite measure and let 
functions f, (x) (n = 1, 2, ...) and f (x) be measurable on E and 
take on finite values almost everywhere on E. Then if {f, (x)} converges 
to f (x) in measure on LE, we can choose a subsequence of {f, (x)} con- 
verging to f (x) almost everywhere on E. 

Proof. We may assume without loss of generality that /, (2) and 
f (z) take on finite values everywhere rather than almost everywhere 
on £ (otherwise the same sets A and A, would be introduced that 
were in proving the previous theorem and al] the reasoning would 


be carried out for the set E\A T A,). It follows from the con- 
n=1 

vergence of {f,(z)} to f(z) in measure on £ that, given any k, 

we can find n, such that for the measure of the set Z,=£ [|f— 


* Frigyes Riezs (1880-1956), a Hungarian mathematician. 
16-01684 
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—fu le {/k] we have [E), |< 1/2". Set, asin the proof of the previous 


theorem, #,= U £,, R= R,. Then by the property of the 
L=n n= { 


is ®] 
outer measure (sce Section 8.2.1) |R,z}= \’[E£,], so that [&,]< 
i 


i= 


< ND 1/2* == 1/2"-1. Thus [R,]—0 as noo. As in the previous 
kh=n 
theorem, we can prove that |Ryf/—~|R] as n> oo. This yiclds 
[Rj =O. 
It remains to prove that everywhere outside AR the sequence 
{7n, (z)} converges to 7 (x). Let x he an arbitrary point of ZX. Then 


x is not in the set Ry for some N=WN (zx). But this means that 
x is not in £, when k>N (x). In other words, [f (2) — fn, H< 
<4/k when k>N (zx). Thus the theorem is proved. 


8.4. THE LEBESGUE INTEGRAL 


8.4.1. The Lebesgue integral of a bounded function. A subdivision 
of a measurable set E£ is any family 7 of a finite number of measur- 
able and mutually disjoint sets £,, £., ..., E, of E adding up 
to the set £. 

To denote a subdivision of a set £ we shall use the symbol 7’ = 
={2),}fa; or T ={L,} for short. 

Consider on a measurable set /& of finite measure an arbitrary 
bounded function f (x). For an arbitrary subdivision 7 ={£,} of E 
we denote by 1’, and m), respectively the supremum and infimum 
of f (x) on a subset £; and define the two sums 


n n 
Sp= \' M,{E,| and sp= \ m,|E;| 
=| k=] 
valled respectively the wpper and lower sums of the subdivision 
LT = {Z,}. 
Note at once that for any subdivision 7 = {/),} 


For any function f (7) bounded on a set of finite measure Z both 
the set of all upper sums {.S;} and the set of all lower sums {s7} 
(corresponding to all possible subdivisions 7 = {£,} of the set 7) 
are bounded. There is therefore an infimum of the set {S;}, which 


we designate J and call the upper Lebesgue integral, and a supremum 
of {s;}, which we designate J and call the lower Lebesgue integral. 


Definition. A function jf (x) bounded on a set of finite measure E is 
said to be (Lebesgue) integrable on that set if J = J, t.e.if the upper 
and lower Lebesgue integrals of the function coincide. 
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The number 1 = I is called the Lebesgue integral of f (x) over E and 
designated 


\ f (x) de. 


E 


We discuss some properties of the upper and lower sums and of the 
upper and lower Lebesgue integrals. . 

Let us say that the subdivision T* = {Ej}, is a refinement of 
the subdivision 7 = {£,}i—, if for any i (i = 1, 2, ..., m) there 
is v (i) satisfying the inequalities 1< v (i) <7 such that E} is 
contained in £y,;j). 

The integer v (i) may turn out to be the same for different i, the 
sum of the sets £; over all i for which v (i) equals the same integer k 
is obviously equal to a set £p, i.e. 


V(i)=h 

Let us also say that the subdivision f = {E;} is a product of subdi- 
visions 7, ={E£ 2} and T, ={E7’} if T consists of the sets £; which 
are the intersections of all possible pairs of sets EP and Ef”, i-e. 
if every E; is equal to £p’{] £,”, all possible combinations of p 
and g exhausted. ; 

Obviously the product 7 of two subdivisions 7, and 7. is a refine- 
ment of either of the subdivisions 7, and 7. (any other subdivision 
of i which is a refinement of both 7, and 7, is itself a refinement 
of 7). 

The following properties of the upper and lower sums and of the: 
upper and lower integrals are valid. 

1°. If a subdivision T* is a refinement of a subdivision T, then. 
ST < STx, STx <= Sir. 

Proof. We give a proof for the upper sums (since for the lower 
sums the proof is quite similar). Let T7* = {#;}7L, be a refinement 
T ={E,}R-, and let Mj be the supremum of f (x) on a set EF (i = 
= 4, 2, ..., m) and A; the supremum of f (z) on a set £, (k = 
= 1, 2, ..., n). 

By the definition of refinement for every number i (i = 1, 2,... 

., m) there is a corresponding integer v (i) satisfying 1 < v (i) < 
< nsuch that Ef is contained in £, ;), the sum of the sets Ej over 
all i for which v (i) equals the same é satisfying equation (8.23). 
It should be added that for all i for which v (i) equals the same 
integer k 

Mi<M, (8.24) 
(for the supremum cna Subset does not exceed the supremum on the- 
whole of the set). 

16% 
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From the definition of the upper sum and from relations (8.23) 
and (8.24) we get* 


<> 


Sy.= % ~ MIET\= \ S -l. NS MPNENN< 


={ f. jen? 


SA » _ eill= S ~ Mil Ex = Sr. 


2°. For two quite arbitrary subdivisions Ty and T., sr, < Sr. 

Proof. Let 7’ be the product of subdivisions 7, and 7. Since 7 
is a refinement of cither of the subdivisions 7, and 7., by Property 1° 

St, sa, Se<Sr,. (S.25) 
From inequalities (8.25) and (8.22) it follows that sz, < Sr,. 

3°. The upper and lower Lebesgue integrals are connected by the 
relation I<. 

Prool. Fix an arbitrary subdivision 7... Since for any subdivision 
ZT, (by Property 2°) sr, < Sy7,, the number Sz, is one of the upper 
bounds of the set {s7,} of all the lower sums, and therefore the 
supremum I of that set satisfies the inequality J < S+z,. Since the 


last inequality is true for an arbitrary subdivision Ts, the number / 
is one of the lower hounds of the set {S7,} of all the upper sums 
and therefore the infimum 7 of that set salishes the condition J] < i. 


Corollary. Any Riemann integrable function is Lebesgue integn cable, 
the Lebesgue and Riemann integrals of such a function coinciding. 
Proof. Let f(z) be Riemann integrable on E =[a, b] (and there- 
fore bounded on that interval). For ‘such a function we denote by 
and J the lower and upper Lebesgue integrals and hy Tp and Tr 
the Jower and upper Darboux integrals (see Chapter 10 of {1]) 


to obtain the following inequalities** 


Ip<I<IT<Iy. (8.26) 


If a function is Riemann integrable, then for it Jp = 7p, and by 
(8.26) therefore J = J, i.e. that function is Lebesgue integrable. 


Moreover, when Jp = Tp (8.26) implies fpel= foly, ie. 
they imply the coincidence of the Riemann and Lebesgue integrals, 


* We take inte account that from (8.23) and from the sets Ef being 
mutually disjoint it follews, bv Theorem 8.8, that “ Ep te | fy |. 


VCijenk 
** Because any subdivision & -= fa, 4) into subintervals is included in the 
class of subdivisions of the set “ in the sense of Lebesgue. 
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for the former is equal to the number Jp = Ip and the latter to 
l=. 

In the next subsection we shall show that the class of Lebesgue 
integrable functions is wider than the class of Riemann integrable 
functions. This will clarify the appropriateness of introducing 
measurable functions. 

8.4.2. The class of Lebesgue integrable bounded functions. We 
prove the following main theorem. 

Theorem 8.16. Whatever a measurable set E of finite measure may 
be, any function f (x) bounded and measurable on E is integrable on it. 

Proof. Construct a special subdivision of a set E called Lebesgue 
subdivision. We denote by A/7 and m the supremum and infimum of 
f(z) on # and divide the interval [m, A‘), using the points m = 
= Yo Sy Yo --- << Yn = M, into subintervals [y,_,, yx] 
(f= 41, 2, ..., 2) and denote by § the length of the largest of the 
subintervals, i.e. set 

5= max (Yn — Yn-s)- 

k=1,2,...,7 
A Lebesgue subdivision of a set E is a subdivision T = {E£,}#-1 in 
which , E,=Elyy<f<y), Ex = Eluns<f<y,) for k= 
= 2,3, ..., 7m. 

Let Sy and sp be the upper and lower sums corresponding to 
a Lebesgue subdivision 7 and called the upper and lower Lebesgue 
sums. Notice that for any fh (A = 1, 2, ..., n) 


Yaa Sm, <= My Sp (3.27) 
where A7;, and m, denote the supremum and infimum of f (x) on 
a subset #,. Multiplying inequalities (8.27) by the measure [Z,] 


of the set £, and then summing them over all k = 1, 2, ..., 7” 
we have 


1 nr 
2 Ynalf,|SSpRSr <x —! YALE, |- 


From these inequalities we conclude that 


nN 


nr 
0<S_p—Ssy <2 YrlEn|— > Yr-s| 2x |= 
= a (Yn— Yroi LEn |< OZ}. (8.28) 
Since for any subdivision 7 we have ss; </</< Sr, from 


(8.28) we get 
0</—I<8j£}. (5.29) 
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Since 6 > 0 may be fixed as small as we please, it follows from 
(8.29) that J = J. Thus the theorem is proved. 

Remark 1. In Supplement 2 to this chapter we prove that the 
measurability of a function f(z) bounded on a measurable set E 
is not only a sufficient but also necessary condition for that function 
to be Lebesgne integrable on £. 

Remark 2, Let & (4 = 1, 2, ...., 2) be an arbitrary clement 
of a subset E). of a Lebesgue subdivision 7. A sum oy (&,, f) = 


oa y > 1 (Es): | £), | will be called a Lebesgue integral sum of f (2). 


L 2s 


Since under an arbitrary choice of points €, on the sets /), that sum 
is contained between the lower and upper sums of the corresponding 
Lebesgue subdivision 7, it follows from inequality (8.28) that 
or (&, f) (together with S; and sy) tends, as 6 — 0, to the Lebesgue 


integral J = T= { f (x) dx. 


E 
8.4.3. Properties of the Lebesgue integral of a bounded function. 


\idr= EF}. 
E 
To prove this it suffices to notice that for the function f (7) = 
both the upper and the Jower sum of any subdivision 7 of a set E 
is equal to | £ |. 
2°. If a function f (x) is bounded and integrable on a set FE of finite 
measure and a is any real number, then the function |a-f (x)] is also 
integrable on E, with 


\ [a-f(z)]dz=ca. \i (x) dz. (8.30) 
E 


E 


Proof. For an arbitrary subdivision 7 = {£,} of a set F£, denote 
the upper and lower sums of the function f (z) by S; and sp and the 


upper and lower sums of the function [a-f(z)] by SY? and sp). Then 
clearly 


(2) aS; When a20, ia | asp When 20. 


3 
asp When «<0, *T a-Sp when a<0. (5.01) 


If we denote by 7 and J the upper and lower integrals of f (7) and 


by 7 and J@ the upper and lower integrals of [a-f (z)], then it 
follows from (8.31) that 


_ a-J when aDSo, ae 
[(4) = - [(2) — 
a-f whena<0. - a 


when 2>0, 


a! ee 
oF 
) 
wee’ 


When “<0. 


By virtue of the integrability of f (z) 


I=I=\ f(x) az 
7 E 


and therefore from inequalities (8.32) it follows that for any a 


TO) = T= a. | f (2) dz. 
E 


This just means that the integral on the left of (8.30) exists and that 
equation (8.30) is true. 

3°, If either of the functions f, (x) and f, (x) is bounded and integra- 
ble on a set of finite measure E, then the sum of the functions 
{f, (v) + fe (x)] is integrable on E, with 


Vf @) +f de= ff (@az t+ Vy. (2) de. (8.33) 
E E E 
Proof. Set f {z) = 7, (z) + f. (x) and let 7 ={E,} be an arbitrary 
subdivision of #. For the function f (z) denote the supremum and 
infimum on a subset £;, by AT, and m,, the upper and lower sums of 


T by Sy; and sz, and the upper and lower Lebesgue integrals by £ 
and /. Similar quantities for the functions /, (z) and f, (x) we denote 


by the same symbols as for f (x) but with the superscripts (4) and 
(2) respectively. 

Notice that the supremum (infimum) of a sum is not greater (not 
less) than the sum of the suprema (infima) of the summands. It follows 
that for any & 


mY +mP<m,<M,< MY + iM? 
and therefore for any subdivision 7 

SP + 9 Sor S57 CSP + SP. 

From these last inequalities it follows then that 

M4 JOST CIO + JO, (8.34) 
Since (dus to the integrability of 7, (x) and f, (z)) 

IO =TM = \f(x)de, T@=Fe= \ fp (x) ae, 


E E 
from (8.34) we get 


I=T= Vi(aydet f fo(a)az. 
E E 
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3ut this just means that the integral on the left of (8.33) exists and 
that equation (8.33) is true. 

Corollary. Properties 2° and 3° lead directly to the linear property 
of the integral: if either of the functions f, (x) and f. (z) is bounded and 
integrable on a set of finite measure E and if a and B are arbitrary 
real numbers, then the function \a-f, (x) + B-f. (z)] is integrable on E, 
with 


lef + Bf dz=a- (Ale) dz+8 [ie(2) ax, 
E E 


~_ 
oo 
out 


4°. If a function f (x) is bounded and integrable on either of the dis- 
joint sets of finite measure E, and E£., then f (x) is integrable on the 
sum E of the sets E, and E. as well, with 


\ftzydz= Vf (zyazt Vs (ayaz. (8.35) 
E 


£; EZ; 


This property is usually called the additivity cithe integral. 

Proof. Notice that the union of an arbitrary subdivision 7, of 
the set £, and an arbitrary subdivision 7, of £. forms a subdivision 
Tf of the set £ = £, UJ £.. Denote the upper sums of f (xz) corres- 
ponding to 7,, 7, and T by Sy,, Sr, and Sy respectively, and the 
lower sums of f (x) corresponding to 7,, T, and 7 by sry,, sz, and 
Sp respectively. Then clearly 


(8.36) 


Denote the upper and lower integrals of f (x) over E, by I and I®, 


those over E, by J@ and J@ and those over E by J and J. 


From equations (8.36) and from the fact that the supremum 
(infimum) of a sum is not greater (not less) than the sum of the supre- 
ma (infima) of its terms we conclude that 


IME IMcT<I<IM+ Je), (8.37) 


Sp=S7,+ S7,, Sy = ST, 7 $1,: 


Since (due to the integrability off ‘z)on E, and on £y)J@ = T® = 


=| f@a, 19 =7o = f(z) dz, from (8.37) we get 

Ey Es 

T=T= \ f(x) de+ (f(2)az. 

- Ey Ey 
But this proves that the integral on the left of (8.35) exists and that 
equation (8.35) is true. 


o°. If either of the functions f, (z) and}, (x) is bounded and integra- 
ble on a set of finite measure E and tf everywhere on that set f(r) > 
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> fe (x), then 
(fi (z)dx> | fo (a) de. (8.38) 


E E 
Proof. Since all the lower sums of the function F (x) = f, (%) — 


— jf, (x) are nonnegative, I => 0. It follows that F (x) dx = 
E 


= f, () dz — fo (zc) dx >0 (the existence of the integral 
E . 


E 
and the above equality follow from the linear property already 
proved). This proves (8.38). 

8.4.4. The Lebesgue integral of a nonnegative unbounded functiow 
and its properties. We now proceed to define the Lebesgue integral 
for the case where the measurable function f (x) is not bounded. We 
first assume f (xz) = O everywhere on a set of finite measure E. 

For any V > 0, we set 


(f)x (z) = min {N, f (2)}, : (8.39)}- 
Ty (f)= | Aw (2) dz. (8.40) 
E 


Notice that for any function f (x) measurable on EF the function 
(8.39) is also measurable* and therefore the integral (8.40) exists. 
Note also that from (8.39) and (8.40) it follows that Jy (f) increases 
as IV increases. 

Definition. If there is a finite limit I y (f) as N +o, then f,(x) 
is said to be (Lebesgue) summable on a set E and the limit is said 


to be the integral of f (x) over E and designated f (x) dz. So by defini- 


E 
tion 


\ (2) dx= lim Iy (f)- 
i NV~+oo 


We show that if a function (zx) nonnegative on a set E is summable 
on that set, then f (x) may tend to +00 only on a subset of E of measure 
zero. Indeed, set E, = E (f = +00] and consider that from (8.40) 
and (8.39) (by Properties 4° and 5° of Section 8.4.3) we have a chain. 
of inequalities 


Ty (f)= \ (x (x) dz \ (fx (x) de> | N deN| Lol. 
E Eo Eo 
* Since for any real a the following set is measurable: 


_sE lf >a} when ac QN, 
El(w > a= an empty set when a> WN. 
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But it follows from /, (f) 2 N | £, | that the assumption | £, | > 
> 0 would result in the limit lim I (f) being equal ta +o, 


It should be added that any junction f (z) is summable on a set of 
measure zero. (This fact is obvious.) 

Turning to the general properties of summable functions, first 
note that Properties 2° to 3° established in Section 8.4.8 for bounded 
integrable functions hold for nonnegative summable functions.* 

By way of illustration, we prove Property 3°. From (8.39) we have 
at once the following inequalities: 


(Ai) wee (@) > (fe) nte (2) S (CA + fe) » (7) S (fi) w (@) (fa) x (2) 


true for any N > O at any point z of a set E. Integrating these 
inequalities over the set £** we establish Property 3° for arbi- 
trary nonnegative summable functions f, and fa. 

Proof of the remaining Properties 2° to 5° for such functions will 
he Ieft to the reader. 

We now proceed to discuss two other fundamental properties of 
arbitrary nonnegative summable functions. 

Theorem 8.17 (complete additivity). Let a set E be a sum of a 
countable number of mutually disjoint measurable sets E), i.e. let E = 


= u Ly. Then the following two statements are true. 


I. 7, a nonnegative function f (x) is summable on a set E, then f{ (x) 
is summable on evert y set £,, with 


ox 


Vi (x) dxz= > { f(x) dz. (8.41) 


E k=] E,, 


II. Jf a function f (x) nonnegative on a set E is summahle on every 
sel Fy and if the series on the right of (8.41) converges, then f{ (z) is 
summable on E too and equation (8.41) is true for it. 

Proof. (1) We first prove Theorems ] and II for the bounded 
nonnegative integrable function f(r). Let there be a_ cons- 
tant Jf such that f(r7)<4/7 everywhere on FE. Set &,= 


ca fe) on 
== (J FE, and notice that by Theorem 8.8 [/,| = » {[LZ,|-9 
t=n~+t and | 


(as n—- oo). But then by Properties 4°, 5°, and 1° 
h 
(f(xjdz— > | f(x) de= | f(z)dz<Mt | dzr= M{R,|—>-0 
E 7 it, 


Lot Ey. R, 


* The constant @ in Property 2° must be nonnegative here. 
" We use Properties 3° and 3° for bounded integrable functions. 
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(as 2 — oo). The last relation proves Theorems I and II for the case 
of the bounded integrable function. 

(2) We now prove Theorem I for an arbitrary nonnegative summa- 
ble function. The summability of f (z) on every £,, follows at once 


from the inequality I x (x)di< (f) x (z) dx and from the 
E;, E 

fact that the integral on the left of the inequality is nondecreasing 

in A’. It remains to prove equation (8.41). Using what was proved 

in (1) above and the inequality (f), (7) <f (x) we get 


V(x (z)dz= >) | (ye (dz< St | f(a) az. (8.42) 
£ 


h=={ E;, k=] E}, 
Proceeding in the last inequality to the limit as VY’ - oo we have 


\ f(z) in<d \ f(@)a (8.43) 
p 


k=1 E;, 


On the other hand, by the properties proved in Section 8.4.3, for 
any m 


[Ow aaz= >] Dx @ae>D f (Nw (oas 
Lc 


h=1 Ep h=i Dp 


and making in the last inequality first ’ approach oo and only then 
m approach co we obtain the inequality 


[reared \ f(2) dz 


h=1 Ep 


which, in conjunction with (8.43), proves (8.41). 

(3) We finally prove Theorem I] for an arbitrary nonnegative 
summable function. Notice that it suffices to establish only the sum- 
mability of f (x) on a set £ (for equation (8.41) will then follow from 
Theorem I we have already proved). 

But summability of f(z) on E follows at once from inequality 
(8.42) and from the convergence of the series on the right of the in- 
equality. The proof of the theorem is complete. 

Theorem 8.18 (absolute continuity of the integral). If f (x) is 
nonnegative and summable ona set i, then, for any positive ¢ there is 
a positive number 6 such that whatever a measurable subset e of E with 
measure |e | less than 6 may be, 


\ (x) dr<g. 
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Proof, (4) First let the nonnegative function f(r) be daunded, 
icc. Jet there be J/7 such that f (7) < WV. Then (by the properties 
established in Section 8.4.3) 


1 J (2) dz<M \dr= Miel< Mi<e when 6< 


¢ € 


(2) We now prove the theorem for an arbitrary nonnegative sum- 
mahle function f (7). On fixing an arbitrary e > 0 we may choase 
(on the basis of the definition of summahility) Vi = N (e) such that 


Vis (z)—(f)x (2 dz<z. (8.44) 


E 


Using (8.44) and the inequality (f)y (z) << N we get 


( rayde= (if @)—Ws de + [Ny le) <$4N lara 


6 ce ¢ e 


— st N. lel< + NS<e, 


provided 6 < e:QN (ec). Thus the theorem is proved. 

In conclusion we show two more properties true only for nonnega- 
tive summable functions. 

Condition for a nonnegative summable function to be equivalent 
to zero: If f (x) is nonnegative, measurable and summable ona set E£ 


and if | } (z) dx is equal to zero, then f (x) is equivalent to an identical 
E 
zero on £. 
Proof. It suffices to prove that the measure of the set £ [f/f > 0] 
is equal to zero. We first show that for any a > 0 the measure of a set 


E, = E \f > al) is equal to zero, Indeed, if the measure | £, | were 
positive, we should obtain the inequality ; f(z) dr > Ai (x) de > 
E 


> aE, >O9 contradicting the condition \ f (c) dx = 0. Now it 
i 
remains to notice that & [f/>O)= U EF [f> Wh) from which 
= 


it follows that JEToO|;< NYE Yt tk} | = 0. 
re | 


Majorizing criterion for the summahility of a nonnegative measur- 
able function: Jf f,; (rz) is nonnegative and measurable ona set I and 
jo (x) is summable on E and if fy (x) < fo (7) is true everywhere on £, 
then f, (x) is also summable on E. 


Ch. 8. The Lebesgue Integral and Measure 203 
i 


Proof. It suffices to notice that 
(tidy (2) de< | (fa) (2) dz< | fo (2) dz 
E E E 


and take into account the fact that the integral on the left of the 
last inequality is a nondecreasing function of JV. 

8.4.5. The Lebesgue integral of an unbounded function with 
arbitrary Sign. Suppose now that the measurable function f (z) is 
not in general bounded on a set £ and takes on values of any sign 
on that set. 

We introduce into consideration two nonnegative functions 


f(2)= 5 (If (ZF (2) and f° (2) = $(IF (2)I—F (2) 


the frst, of which, f* (xz), coincides with f (x) on the set E [f > Ol, 
and is equal to zero on E [f <0], and the second, f- (x), coincides 
with —f (z) on E (f < 0] and is equal to zero on FE [f > OJ. Obvious- 
ly, fa) + @) =1f@b ©) —f @ = ©. 

Definition. A function f(z) is said to be summable on a set E if 
either of the nonnegative functions f* (xz) and f- (xz) is semmable on 


that set, the difference of the integrals { {* (x) dx — | f- (x) dz 


E E 
being called the Lebesgue integral of f(z) over E and designated 
lf (2) ae. 
E 


So by definition 
|f(a)de= {f+ (2)d2—[f-(a) az. (8.45) 
E E E 


, The lerm “integrable function” is often used instead of “summahble 
gee 
He collection of all functions summable on a set & is usual] 
feugnated L (E) ie L' (E). The notation f (2) € L (£) means iat 
an a es 
Heated Be 9 on £ (£), i.e. is a function measurable and 
Stress thal a function f (x) measurable on E is summabl 
a if the function | f (x) | is summable on E. aes 
fine if { (x) is summable on E, then by definition either of the 
a at (x) and f- (z) is summable on EF and therefore so is the 
lioweyer tree f{* (x) + f- (x) that is equal to | f (x) J. If, 
’ x) | is summable on E, then from the measurability 
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of cither of the functions f* (x) and f~ (x) and from the inequalities 
7 (zZ)< 17 (2) 1-7 (e) <I FP (x) | it follows, by virtue of the major- 
izing criterion for the summability of a nonnegative measurable 
function (see this at the end of Section 8.4.4), that either of the 
functions f* (7) and {7 (x) is summable on £, and this exactly means 
that f(z) is summable on £. 

Thus, for the Lebesgue integral (in contrast to the Riemann inte- 
gral) the integrability of f (vz) is equivalent to that of | f (z) |. 

We now consider the properties of arbitrary summable functions. 

Note immediately the validity for arbitrary summable functions of 
Properties 2° to 3° established for bounded integrable functions in 
Section 8.4.3 and for nonnegative summable functions in Section 
8.4.4. That these properties are valid for arbitrary summable func- 
tions follows immediately from equation (8.45) and from the validity 
of the properties for nonnegative summable functions. 

Finally, for arbitrary summable functions the properties of com- 
plete additivity and absolute continuity of the Lebesgue integral still 
hold (proof of these properties for nonnegative summable functions 
made the contents of Theorems 8.17 and 8.18 of Section 8.4.4). We 
give a formulation and some brief hints concerning the proof of 
these properties. 

Theorem 8.17* (complete additivity). Let a set E be a sum of 
a countable number of mulually disjoint measurable sets E),, i.e. let 


E= _U Ey. Then the following two statements are true. 
= | 


T. Jf f (x) is summable on E, then f (x) is summable on every sel Ej, 
equation (8.41) being true. 

I]. Jf f (2) is measurable and summable on every set E,, and if the 
series 


Dd | ifteyiaz 
h=1 E}, 


converges, then f (x) is semmable on E and equation (8.41) is true. 
To prove Theorem J it suffices to apply Theorem 8.17, J, to the 
nonnegative functions j* (z) and f- (rz) and use equation (8.45). 
To prove Theorem J] it suffices to consider that by Theorem 8.17, 
JI, the function | f(z) | is summable on £, But then so is f (7) and 
equation (8.41) holds by virtue of Theorem J already proved. 
Theorem §.18* (absolute continuity of the integral). If | (x) 
is summable ona set E, then for any positive ¢ there isa positive num- 
ber & such that, whatever the measurahle subset e of EF with measure 


le | less than 5, the inequality | [ f(x) dx |{<e holds. 


c 
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To prove this it suffices to apply Theorem 8.18 to the nonnegative 
function {7 (z) | and use | | f(z) dz |< | Jf (2) Lae. 


a ec 
8.4.6. Passage to the limit under the Lebesgue integral. 
Definition. It is said that a sequence {j, (z)} of functions summable 
on a set £ converges to a function f (x) summable on the same set in 


L (E) if 


lim { |, (2)—f (z)|dx= 0. (8.46) 
moe f 


The convergence of {7, (z)} in Z (£) ensures the possibility of a term- 
by-term integration of {7, (z)} on ZY, for it follows from (8.46) 
that lim ) i, (2) de = \i (x) dr. 

Xotice that ij a sequence {7, (x)} of junctions measurable and sum- 
madle on a set E converges in L {£) to a junction Ff (z) measurable and 
summable on E, then iF (x)} converges to 7 (x) also in measure on £. 

Indeed, on fixing an arbitrary e > 0 and denoting by £, a set 
Etlf—#, [>] we have 


by Fare y 


fn (z)—F (aided | 7, (2) —F (@lezSel Es}. 


so that it follows from (8.46) that }£, |] -~0 as 2m ~o~. 

Thus convergence in measure on £& is weaker than convergence in 
L (£) (and, as has already been established, weaker than convergence 
almost everywhere on £). 

We prove, however, that under additional assumptions conver- 
gence in measure on £& will imply convergence in £ (£). 

Theorem 8.19 (Lebesgue's theorem). If a sequence {7,, (z)} of fune- 
tions measuradble on a set E converges in measure on E to a function 

f(z) measurable on E and if there is a function f (x) summable on E 
such that jor all n and almost ail the poinis of EF we have fi, (a) i< 
< F (2), then {7, (x)} converges to f (x) in L (E). 

~ Proof. We first show that the limit function j 7 (z) itself satisfies 
almost everywhere on E the inequality | 7 (z) |} < F (a). It follows 
from Theorem §.15 that we can choose a subsequence {f,, (2)} 
(EF = 141, 2. ...) of {4 (z)} converging to 7 (z) almost everywhere 
on E. Proceeding in the inequality |7,, (z) |< F (2) to the limit 
as k —- oo we get | f(z) | < F (zx) for almost all the points of E. 
From the inequality we have proved and from the majorizing crite- 
rion for the summability of a nonnegative measurable function (see 
this at the end of Section &.4.4) it follows that f (x) is summable on F. 
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On fixing an arbitrary ¢€ >O and denoting by E£, a set 
Ef — fn |e] we have* 


[1f(2)—fa(a)lar= (f(x) fn (2)lde+ 


s 


E 


E , 
+ \ [fl)—fn(x)ldr<2\ P(a)dr+elZI. 
E.E, En 


From this inequality and from the arbitrariness of e > O it follows 
that to establish the convergence of {f, (z)} to f (x) in L (£) it suffices 


to prove that lim [ F (x) dx = 0; but this follows immediately 
New 8 

from Theorem 8.18* on the absolute continuity of the integral and 

from the fact that under the hypothesis | £, | +0 as n — oo. Thus 

the theorem is proved. 

Corollary (Lebesgue's theorem on passage to the limit under the 
integral). Ifa sequence {f, (z)} of functions measurable on a set E 
converges almost everywhere on E to a limit function f (x) and if there 
isa function F (x) semmable on E such that for all n and almost all the 
points of If we have | f, (x) |< F (x), then f (x) is summable on E and 


lim |, (x) dx= (f(z) dz. (8.47) 
nee E 


Proof. It follows from Theorem 8.13 thatf (z) is measurable on E. 
Hence it suffices to notice that convergence almost everywhere on £ 
implies (by virtue of Theorem 8.14) convergence in measure on £; 
and to use Theorem 8.19, 

Theorem 8.20 (B. Levi's theorem). Let every function f, (x) be 
measurable and summable an a set FE and let fy (2) < fn4y, (x) for all x 
and for almost all the points of E. Suppose further that there is a con- 


stant AI such that | [ fn (x) dx |< M for alln. Then for almost every 


E 
point x of & there is a finite limit lim f, (x) = f(x), the limit 
N--co 
function f (x) being semmable on FE and equation (8.47) holding. 
Proof. We may assume without loss of generality that all f, (z) 
are nonnegative almost everywhere on £& (otherwise we should take 
the nonnegative functions g, (r) = f, (x) — f, (x) instead of f, (2)). 
Since {f, (z)} is nondecreasing almost everywhere on E, a limit func- 
tion f (x) is defined at almost all the points of /& taking at them 
either finite values or values equal to --oo. [If we prove that limit 
function is summable on £, then it will follow that f(z) has finite 
values almost everywhere on £, i.e. it will follow that {f, (z)} con- 
verges to f (x) almost everywhere on /, and this and the inequality 


* We consider that [f/, (2) —f (2) | << 20 (z) almost everywhere on F. 
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fn (xt) <f (xz) (almost everywhere on £) will lead, by virtue of the 
corollary of the previous theorem, to equation (8.47). 

So, to prove the theorem it suffices to establish that the limit 
function f (x) is summable on E£. 

Notice that for any N > 0 the sequence* {(f,) ,(x)} converges to 
(7) (z) almost everywhere on £, the bounded function (f) x (z) 
being summable on £ and (f,) x (2) < (f) yw (x) for all n and almost 
all points of £. 

This ensures the applicability to {(f,) v (z)} of the corollary of the 
previous theorem, by which 


Lim | (fn)x (2) d= | (Ay (2) dz. 
2°" E E 


From this and from the inequality** 


[fn (2) de> | (fa) xe (2) da 
E E 


we conclude that 


lim | fn (x) dx | (f)x (2) a, 
and since \ fn (2) dx < AT for all n, so is 
E 


\ (fx (x) dx<ia. (8.48) 
B 


From inequality (8.48) and from the fact that the integral on the 
left of (8.48) is nondecreasing in NV it follows that there is a limit 


lim { (f)y (2) dz, 
N—-00 f 


which just means that f (xz) is summable on Z. Thus the theorem is 
proved. 
We now formulate Theorem 8.20 in terms of a functional series 
(the theorem is widely used in this form). 
If every function u,, (x) is nonnegative almost everywhere on a set E, 
is measurable and summable on it, and if the series 
oS 


Y [un (2) az 


n=1E 


* Recall that for any N > 0 and for any function F (x) we set (F), (z) = 
= min iV, F (x)}. . 
** This inequality follows from (f,)y (z) = min {N, f, (z)). 


17--01684 


%} 
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converges, then so does almost everywhere on E the series 


ce 


vv 


a= 


n=] 


the sum § (z) of the series (8.49) being summable on FE and satisfuing 
the condition 


u,,(Z), (5.495 


a2 


\s {z) dz == »Y (a, (x) dx. 
E 


=iE 


Theorem 8.21 (Fatou’s theorem). If a sequence {f,, (x)} of functions 
measurable and summable on a set E converges almost everywhere on E 
to a limit junction f (z) and if there is a constant A such that 


lin (z) | dz <A for all n, then ffx) is summable on E and 


lf (z) |dz <A for it. 


TY aeons een, 


Proof. We introduce into cunsideralion functions § g,, (z) = 
= ini | 7; (xz) |* and notice that every function g, (x) is nonnegative 


71 
and measurable** on £ and that the sequence {g, (zj} is nondecreas- 
ing on E and converges to | f (x) | for almost all points of £. In 
addition, for any n everywhere on £ 


Bp (Z) <S | fp (2)! (5.50) 


from which (by the majorizing criterion for the surmahility of « 
nonnegative measurable function, see this at the end of section 5.4.4} 
it follows that g, (z) is summable on E. Applying to {g,, (xj} Theo- 
rem 6.20 we get 


lim |g, (2) dx= | {f (2)Idz. (8.51) 


Tt 20 FE E 


wal 


a 


Since by (8.59) \ £2, (x) dr< [in (z) | de <A for any n, from 
E Cd 
(8.51) we get \ 


f(z) | dx <A. Thus the theorem is proved. 
E 
8.4.7. Lebesgue classes LE? (££). Recall that a Jinear space J is 
said to be normalised if the following two requirements hold: {4} 
there is a rule by which each element f of FR is assigned a rca] number 
called the norm of that element and desiyneted |i f {lp, (2)j;the role 
catishes the following three axioms: 


* The notation means that for every =z the value of ¢,, (r) ie the infimum of 


fn zits Sfnar fz) dl, .--. 
"ee Th hat ¢, x) is reeasurable on FE follows from Theorem 6.12. 
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1°. if iin > 9, if 7 0", Tf lle = 0, if f = 0. 

2°. WAfile = 1AI-Ilf lla for any element f and any real number A. 

3°. For any two elements f and g the so-called triangle inequality 
(f+ ¢ella<lf lle + Ul g Ile is true. 

We shall consider in a normalized linear space & an arbitrary 
sequence of elements {f,}. 

Definition 1. A sequence {f,} of elements of a normalized linear 
space R is said to be fundamental if 

lim lfm—Tn ll; = 0. 


men 
TL->00 


Definition 2. A sequence {f,} of elements of a normalized linear 
space R is said to converge in R to an element f of R if 


lim {I fx—f ln = 0. 


Such convergence is sometimes called convergence in norm or strong 
convergence in R. 

It is easy to prove that any sequence of elements {f,} converging in R 
is always fundamental. Indeed, if there is some element f such that 
(lf, —f |p 20 as » — oo, then from the triangle inequality 


lfm — fn lin Ifm —flle +f — fa Ile 
it follows immediately that 


lim I| fm—T lr = 0. 
mSn j 
N= 00 


The question naturally arises as to whether any fundamental 
sequence of elements {/,} is convergent in to some element f of R. 

Definition 3. A normalized linear space R is said to be complete if 
any fundamental sequence of elements { fn} of R converges in R to some 
element f of R. 

We shall discuss here an important class of normalized linear 
spaces due to Lebesgue and prove the completeness of{ these spaces. 

Let a real number p satisfy the condition p > 1. 

Definition 4. We shall say that a function f (x)|belongs to the class 
(or space) L? (£) tf f (x) is measurable on a set} E and | f (x) |? ts sum- 
mable on that set**. 

It is easy to see that for any p > 1 the class L? (£) is a normalized 
linear space if the norm is introduced in it using the relation 


Nf Wee = if lp = ( ) lf (x)|? dx)”, 
' fE 


po eee 
* Q stands for the zero element of a linear space 2. 
xt We shall not distinguish between functions equivalent on E, regarding 


them as a single element L? (£). 


{7* 
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The linearity of such a space is obvious. It is easy to verify Axioms {° 
to 3° using the definition of a normalized space. Axiom 1° follows 
immediately from the condition for a nonnegative summable function 
to be equivalent to zero (see this at the end of Section 8.4.4). Axiom 
2° is quite obvious. Axiom 3° is obvious for p = 41, and for p > 1 
it follows from the \Jinkowshki* inequality 


({ If (2) +2 (2) Par)" <{ | Is (ay? dz)" +. ( Ig (2)? dx) 
E 


5 Os E 


established in Supplement { to Chapter 10 of [4]. 

We now prove the following theorem**. 

Theorem 8.22. For any p> 1 the space L” (E) is complete. 

Proof. Let {f, (z)} be an arbitrary fundamental sequence of ole- 
ments of the space ZL? (£). Set 


Ey, = Sup \| Im— fn Ip 
man 


(the supremum of [| fr — fn Ilp is taken over the sot of all mm satis- 
fying the inequality m >). It follows from the condition for a 
sequence {f,} to be fundamental that ¢, —-0 as n + oo. It follows 
that we can choose a subsequence of n, (k = 1, 2, ...) such that the 
series** * 


co 

— en, (8.52) 
converges. 

From the Holder**** inequality established in Supplement 1| to 
Chapter 10 of [4] 


( \ lf (x)-¢ (x)| dz<( \ | f (x)? dx)" \ le (x)? dz)" 
E ‘ u 
(p>1, q=—2 7] it follows when p> 1 


* In that supplement the Minkowski inequality has been established for the 
case of the Riemann integral. In the case of the Lebesgue integral it suffices to 
establish this inequality only for bounded functions f(z) and g¢ (r), and for 
such functions the proof is similar to that for the Riemann integral (it is suffi- 
cient to consider a Lebesgue subdivision of the set E). 

** In a special form (relating to the so-called trigonometric system) this 
theorem was proved in 1907 by F. Riesz and independently by Fisher. In 1909 
Hermann Weyl noticed that the relation to the trigonometric system is uncs- 
sential and gave the more general formulation (for p = 2) presented here. 

*** It suffices to take nm, sttch that e,, < 274. 

sett In that supplement the Hélder inequality has been established for the 
Riemann integral. In the case of the Lebescue integral it suffices to establish 
this inequality only for bounded functions { (z) and g¢ (x), but for such functions 
the proof is similar to that for the Riemann integral (it is sufficient to consider 
a Lebesgue subdivision of a set EF), 
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[ [7np., (r)— Ta, (x) \dz< || Ina, (z) —Tn, (x}\[p- ( { 42 dz | i/q < 
E 


E 

poi 
SEn,° [E'| P ’ 
and from the last inequality and the convergence of the series (8.52) 
it follows that the series* 


D | Wag. @)— fay, (az (8.53) 
h=1 E 
converges. From the convergence of (8.53) and from Theorem 8.20 
(see the formulation of the theorem in terms of a series) we conclude 
that the series 


a, Urner @) — Fangs 


converges almost everywhere on £, and therefore so does the series 
co 
° “I 
fy (2) + 31 Ufngay @)—fny, @- 


But this means that the Ath subsum of that series equal to fray (2) 


converges almost everywhere on FE to some function f (z). Further, 
since || fm (tT) — fn, (2) lp em for any m and any n,m and 
since [fm (t) — fn, (t)] > [fm (x) — f (z)} as k + co almost every- 
where on E£, by Theorem 8.24 (Fatou) |] 7m (2) — f (2) lp tm 
{for any mz) and this just means that the sequence {7,, (x)} converges 
in Z* {Z£) to f(z). Thus the theorem is proved. 

8.4.8. Concluding remarks. The central point of the Lebesgue 
theory is closure under the operation of proceeding to the limit in the 
theory of measurable sets (Theorems 8.5 and 8.8). in the theory of 
measurable functions (Theorem 8.43), and in the theory of the integral 
(Theorem 8.22). 

We discussed everything for the case of a single variable. In the 
case of mz variables the scheme of constructing the theory remains 
the same, but instead of the interval (a, 5b) we should take an open 


n-dimensional parallelepiped [] (@, <2, < dx) (allowing —co 
k=} 


values for a, and -+co values for b,) to be the original (primary) 
set. In nm dimensions the only qualitatively new feature of the theory 
is the so-called Fubini theorem on reduction of the n-fold multiple 
Lebesgue integra] to an iterated integral of lower multiplicity. We 
shall skip this theorem. 


* We need not use the Hélder inequality when p = i, for the series (8.53) 
coincides with (8.52). 
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SUPPLE/PAENT 1 


THE NECESSARY AND SUFFICIENT CONDITION 
FOR RIEMANN INTEGRABILITY 


Without loss of generality, consider functions defined on a closed interval 
[G, {1}. For every such function f (z) we introduce the so-called Batre functions 
m (zr) and SJ (r) corresponding to the upper and lower limits of f (r) at every 
point z*. So by definition 


m(z)=lim f(y), Af (z)=lim f (). 
yor ywx 
Notice that Baire functions may be defined in another way: 
m(z)= lim {int {()], A (@)= lim [sup f(y), 
6-040 v(x) 6-040 D4(x) 


where rg (z) is a 6-neighbourhood of a point z (in case z is an end point of [0, 1] 
the right- or Jeft-hand 6-balf-neighbourhood of x respectively should be taken 
instead of the 5-neighbourhood). 

Obviously the function f (x) is continuous at a point xq tf and only if f (ro) = 
= mM (19) = AI (xo). 

Theorem 8,23. For a function f (r) bounded on {0, 1] to be Riemann integrable 
on {0, 1] it is necessary and sufficient that f (z) should be continuous almost everu- 


where on [0, 1). 
k—~—4 I; 
2A? ¥} 
(i = 1, 2, 3,..., 2%) and introduce into consideration two step functions q, (z) 
and , (z), setting them equal to inf f(y) and sup f(y) at every interval 
ri a” 
Am) and to zero at points 4/2" (k = 0, 1, ..., 2"). Then on taking for every 


point z = k/2" a sequence of intervals A‘ contracting to z we get 


lim gy (z)=m (z), “lim @, (z)-=M (z). (8.54) 
n-0o n-+00 


Proof. For any n, divide [0, 1} into 2% intervals Ain =( 


The convergence (8.54) thus occurs almost everywhere on [0, 1]. Since the step 

functions ¢n (z) and ,, @) are a fortiori measurable on [0. 1), it follows from 

(8.54) and Theorem 8.13 that so are the Baire functions m (zr) and Jf (z). 
From (8.54) we see that almost everywhere on [9, 1] 


lim [Mn (7)—@n (7)] = MV (=)—m (2). 

Noe 
From the last relation it follows by virtue of the corollary of Theorem8.19 
that** 
{ 


[Dy (7) ~ Qn (2)] dr = ) [Af (z)—m (z)] dz. (8.55) 
0 


ey om 


lim 
n-co 
* Jn case the functions (x) is not bounded below (above) in an arbitrarily 
smal] neighbourhood of z, we set the Jower (upper) limit of f (z) equal to 
—oo (oc) at that point. 
** sJenceforth al} integrals in Supplement { are understood in the sense of 
Lebesgue. 
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It remains to notice that 


1 1 
| Op (z) dz=Sp, | Mn (z) dz = Sns (8.56) 
0 9 


where S, and s, are respectively the upper and lower Darboux sums corresponding 
to the subdivision (A{”) (k = 1, 2, 3, -- + 2%) 
From (8.55) and (8.56) it follows that 


{ 
lim (S_—sp) = \ [M (2) —m (2)] dz, 
Tl» CO 0 


so that (by virtue of Chapter 10 of [4]) a necessary and sufficient condition for 
Riemann integrability reduces to the equation \ [Mf (x) — m (z)] ex = 0. But 


0 
the last equation, by virtue of the condition for a nonnegative measurable and 
summable function to be equivalent to zero (see Section 8.4.4), means that 
M (x) — m (x) = 0 almost everywhere on [0, 1]. Thus the theorem is proved. 


SUPPLEMENT 2 


THE NECESSARY AND SUFFICIENT CONDITION 
FOR THE LEBESGUE INTEGRABILITY OF 
A BOUNDED FUNCTION 


Theorem 8.24. For a function f (x) bounded on a measurable set E to be Lebesgue 
entegradic. on that set it is necessary and sufficient that the function should be measur- 
able on &. 

Proof. Proof of the sufficiency makes the content of Theorem 8.16, it is 
therefore necessary to prove only the necessity. 

Let a function f (z) be bounded and Lebesgue integrable on a measurable 
set E. This means that the upper and lower Lebesgue integrals of that function 
are equal and there is therefore a sequence of subdivisions 7, = {E{™} of E 


such that the corresponding sequences of the upper sums {S,,} and the lower 
sums {s,} satisfy the condition 5S, —s, <1/n, each subsequent subdivision 
T,, = {ER} being a refinement of the previous subdivision 7,_; = {£}7%}. 


(To construct such a sequence of subdivisions it suffices wherever necessary to 
take a product of subdivisions to be introduced). 
Recall that by definition 


Sp = >) AMERY |, sn m{P- AM, 
R R 


where M{P and m}™ are respectively the supremum and infimum of f (z) on a se 
Ey, . 

We define two sequences of functions {f, (z)} and {f, (z)} by setting f, (z) 
equal to Mi) on EM), and fn (z) equal to mi") on EM), : 


264 Fundamentels of Mathematical Analysis 


It is obvious that for every n the two functions f, (x) and fy (%) are measur- 
able on E (for these functions are linear combinations of characteristic functions 
of measurable sets E}P). 


It is also obvious that the sequence {/,, (r)} is nonincreasing and (in (x)) is 
nondecreasing on £, with 

fn (2) <f (2) <Tn (2) (8.57) 
for any n at each point of E. Set f(z) = lim tn (7), f/@) = lim fr (2). Prom 
(8.57) we conclude that mv =~“ 

f(z) </@) <T. (8.58) 


f (x) and f (z) being measurable on £ by virtue of Theorem 8.13. 
From Theorem 8.20 (B. Levi) we get 


lim | (fn (z)—fn (z)] dz= | Uf (z)—f (z)] dz. (8.59) 
Tl-+ 00 E E 


From the definition of the functions /, (z) and fy (z) it follows that 
[ lin (2) ~f, (2) dz = S, —s,, with lim (S, —s,) = 0 by construction. 
~ Noo 


E 
By virtue of (8.59) this leads to the equation \ [f (x) — f (z)) dz = 0. 


From the last equation and from the nonnegativity and measurability of the 
function [f (x) — { (z)j it follows by virtue of Section 8.4.4 that f (7) — {[() = 
almost everywhere on E£. Consequently, by (8.58) f (x) = f (x) = f f (x) almost 
everywhere on £, and since f (x) and } (z) are measurable on £, by Property 4° 
of Section 8.3.2 so is f (z). Thus the theorem is proved. 


CHAPTER 9 


INTEGRALS DEPENDENT 
ON PARAMETERS 


In this chapter we study a special class of functions characterized 
by a common name of “parameter-dependent integrals’. We may get 
an idea of these functions if we integrate a function of two variables- 
x and y with respect to x with every y fixed. As a result we shall 
obviously obtain a function dependent on the parameter y. 

The questions naturally arise as to whether such functions are- 
continuous, integrable and differentiable. These questions are to be- 
studied in the present chapter. 

It is quite clear that integration with respect to the independent 
variable x must not necessarily be proper—if the domain of the- 
function f (z, y) is an infinite band l={a << r<0,c<y< dG, 
then integration with respect to z, with y fixed, is carried out over: 
a half-line and therefore the corresponding integral over the variable- 
xz is improper. This leads to the concept of parameter-dependent 
improper integrals. In this chapter we shall study the properties of 
such integrals. 

We stress that throughout the chapter we discuss Riemann, and 
not Lebesgue, integrable functions and that all integrals, whether- 
proper or improper, are understood in the Riemann sense. 


9.4. PROPER INTEGRALS DEPENDENT ON A PARAMETER 


9.1.14. The parameter-dependent integral. Suppose in a rectangle: 
Tl={e<c2<b,cxy<d} a function f (z, y) integrable over x 
on the closed interval a <x < b is defined for any fixed y in the- 
closed interval e << y <d. In this case on c< y < d the function 


b 
I()= | t(e, y) de (9.4). 


is defined called the integral dependent on a parameter y. The function 
f (xz, y) may be given on a set of a more general form as well. For 
example, the set D ={a(y)<z<b(y), ex y <d} may serve- 
as the domain of f (z, y). In this case the function of y is defined on. 
[c, d] using relations (9.1) but the limits of integration, a and b, 


266 Fundamentals of Mathematical Analusts 


will depend on y. We shall first study the case where the limits of 
integration are constant. 

9.1.2. Continuity, integrability and differentiability of parameter- 
dependent integrals. The following theorems give the answer to the 
above questions. In these theorems JI will denote the rectangle 
{axrcb,cay< 4d}. 

Theorem 9.1. ff a function f (x, y) is continuous in TY, then the 
function I (y) defined by relation (9.1) is continuous on {e, dl. 

Proof. It follows from formula (9.1) that the increment AJ = 
= I (y + Ay) — J (y) of F (y) equals 


bh 


Al = \ [f(z, y+Ay)—f (z, y)] dz. (9.2) 


a 


Since by the Cantor theorem f (z, y) is uniformly continuous in I], 
given © > 0 we can find 6 > 0 such that for all z in {a, bJ and all y 
and (y + Ay)in [c, d] such that | Ay | << 5 we have { f (xz, y+Ay)— 
— f(x, y)|<e/(b — a). But then it follows from relation (9,2) 
that when {| Ay |<< 6 we have the inequality | AJ |< e which 
implies that J (y) is continuous at each point y of [c, d]. Thus the 
theorem is proved. 

Theorem 9.2. If f (x. y) is continuous in TI, then T (y) is integrable 
-on [c, d]. Aforeover, 


d d by ; a] d 
\rindy=l\[ Vr, naz|ay= | ae | fitz, way. (9.3) 


In other words, under the hypotheses of the theorem @ parameter- 
dependent integral may be integrated with respect to the parameter 
under the integral. 

Proof. By Theorem 9.1 J (y) is continuous on [c, d] and is therefore 
integrable on that interval. The validity of formula (9.3) follows 
from the equality of the iterated integrals occurring in (9.3) (they 


are equal to the double integral (| re, y) dx dy). Thus the 
“i 
theorem is proved 
Remark. [n (9.3) instead of the upper limit dof integration with 
respect to ¥ we may put any number from the closed interval [c, d]. 
Theorem 9.3. If f(x, y) and its partial derivative ot are con- 
tinuous in YI, then I(y) is differentiable on {c, d] and its deriv- 
ative can be found from the formula 


I 
af DEE tp (9.4) 
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In other words, under the hypotheses of the theorem a parameter- 
dependent integral may be differentiated with respect to the parameter 
under the integral. 

Proof. Consider the following auxiliary function: 


b 
N 7 wy 
By) = { eo dz, (9.5) 


a 


Since - is continuous in II, by Theorem 9.14 g (y) is continuous on 
(c, d| and the integral of the function over [c, y] can be found from 
the formula for integration under the integral. According to the 


remark to Theorem 9.2 we get 
y b v , b b 
| ¢@dt= [da | SoM =| te, y) dz— | f(z, c)dz. (9.6) 
¢ a c a a 
b 


b 
since \ f(x, y)de =TJ(y) and \ f (x, c) dx = TI (c), from rela- 


tion (9.6) we obtain the following representation for f (y): 
v 
Liy=\ e@d+r(e). (9.7) 


As is known, the derivative of an integral with a variable upper 
limit of a continuous function g (?) exists and is equal to the value 
of that function at a point y. Therefore the function J (y) is differen- 


tiable and its derivative es is equal to g (y). Turning to formula 


(9.5) for g (y) we see that relation (9.4) is true. Thus the theorem is 
proved. 

9.1.3. The case where the limits of integration depend on a parame- 
ter. We have already said that a case is possible where the limits of 
integration depend on a parameter. We shall assume that the func- 
tion f (z, y) is given in a rectangle II containing a domain D defined 
by the relations {a Wy) <z<b(y), cx y<q) (Fig. 9.4). If for 
any fixed y in {c, d] the function f(z, y) is integrablo over x on 
{2 (y)), {b (y)], then obviously the following function is dofined on 
{c, df: 

o(y) 
I(y)= | f(, yde (9.8) 
a(y) 
which is an integral dependent on the parameter whose limits of 
integration are also parameter-dependent. 
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We shall investigate such integrals for continuity and differentiabil- 
ity with respect to the parameter. The following theorems give the 
answer to the above questions, 

Theorem 9.4. Let f (x, y) be continuous on 11 and let a (u) and 
b (y) be continuous on {c, d\. Then I (y) defined by relation (9.8) is 
continuous on [e, dJ. 

Proof. Choose an arbitrary yy in [c, d] and represent J (y) as 


b(t) b(3) a(y) 
Tiyy= | f(a, ydet | f(x y)de— { f(a, nyar. (9.9) 
ative) b(Yo) G( Yo) 


Since the first integral on the right of (9.9) is an integral dependent 
on the parameter y, with constant limits of integration and constant 


Fig. 9.4 


integrand, it is a continuous function of y and tends therefore to 
I (Yo) aS y — Yo. For the other two integrals we obtain the following 
estimates: 


b(y) 

f(a, y)dr{ SMS (y)—5 (9), 
uno) 
acy) 
| f(z, y) dz] <A a (y)—a2 (up) |, 
a(t) 


where Jf = sup | f (z, y) |. From the last inequalities and from the 
if 


continuity of the functions a (y) and b (y) it follows that as vy — yoy 
the last two integrals on the right of (9.9) tend to zero. Thus the 
limit on the right of (9.9) as y + y, exists and equals J (y,). So 
I (y) is continuous at any point y, of [c, dj, i.e. is continuous on 
[c, d]. Thus the theorem is proved. 

We now prove the theorem on the differentiability of the integral 
I (y) with respect to the parameter. 


Theorem 9.5, Let f (2, y) and its derivative “ be continuous in WY. 
Also let a (y) and b (y) be défferenttabie on {c, dl. Then the function 
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I (y) defined by relation (9.8) is differentiable on (c, d] and its derivative 
I’ (y) is given by the formula 


b(y) 
I’ yy) = \ ge dab’ (y)f(o(y), y)—@’ (y) fF (ay), Y)- (9.10) 
a(y) 


Proof. Choose an arbitrary yo in {c, d] and represent / (y) in 
the form (9.9). The first integral on the right of (9.9) is an 
integral dependent on the parameter y, with constant limits of 


integration. Since under the hypothesis f(z, y) and <u are con- 


tinuous in II, by Theorem 9.3 the first term is a differentiable 
function at a point yo and the derivative of that function at that 


D( to) 
Eto) dx. We prove that the second term 


point is equal to { 
a(tio) 

on the right of (9.9) has a derivative at the point yo. Since 

that second term vanishes at y=Yo, is suffices to show the exis- 

tence of the following limit: 


a 

I (z, y)dz 

lim b(1o) (9.11) 
vr,  I—¥o 


which by definition is just equal to the desired derivative. 
We transform the integral in the numerator of formula (9.11). 
From the mean value formula we have 


o(y) _ 
\ F(a, y)da=F (x, y) (b(y) —8 (yo)), (9.12) 


bo) 


with zx contained between b (y,) and b(y). Substituting the expression 
for the integral from formula (9.12) into the numerator of the ex- 
pression (9.441) and considering that by continuity f(z, y)—~ 
> F'D(Yo)s Yo) as y—> Yo and “= Sed 
0 
that the limit (9.11) of interest to us exists and equals 
b’ (Yo) f (b (Yo), Yo). Reasoning quite similarly we see that the third 
term on the right of (9.9) has also a derivative at the point Yo 
equal to 2’ (¥o) f(@ (Yo), Yo)- 

So we have proved that / (y) is differentiable at an arbitrary point 
Yo of [c, d] and that its derivative J’ (y)) is given by formula (9.10). 
Thus the theorem is proved. 

Remark. Theorems 9.4 and 9.5 are also true in the case where 
f(z, y) is given in a domain D alone and satishes in that domain 
the same requirements as in [I. 


—>b' (yo) as YY We see 
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9.2, IMPROPER INTEGRAL DEPENDENT ON A PARAMETER 


9.2.4. The parameter-dependent improper integral of the} first kind. 
The uniform convergence of a parameter-dependent improper integral. 
The symbol MW. will denote a half-band {a<2<0, c<y<d}h. 

Suppose in IT. a function integrable over z in the improper sense 
on the half-line a < x2 < oo is given, with any y of Ic, ad] fixed. 
Under these conditions the following function is defined on fe, dJ: 


Tiy)= | fle nae (9.18) 


which is called an improper integral of the first kind dependent on the 
parameter y. ‘The integral (9.43) is said to converge on the closed 
interval [c, J. 

In the theory of parameter-dependent improper integrals an 
important role is played by the concept of uniform convergence. 
We shall formulate this notion. 

Definition, An improper integral (9.13) is said to be uniformly con- 
vergent with respect to a parameter y on a closed interval |e, a) if it 
converges on {c, ad) and if given any € > 0 we can find A > a deper- 
dent only on € such that for any R > A and for every y inte, d] 


oo 


| f(z, y) az 


F 


<é. (9.14) 


We formujate the Cauchy criterion for the uniform convergence of 
parameter-dependent improper integrals. 

Theorem 9.6. For an improper integral (9.13) to converge uniformly 
with respect to a parameter y on a closed interval (c, d] it is necessary 
and sufficient that given any ¢ > Owe should be able to find a number 
A >a dependent only on ¢ such that for any R' and R" greater than A 
and for every yin le, d) 


Nn” 


(f(z, y) dz 


Ie’ 


<€. 


The validity of this criterion follows immediately from the definition 
of uniform convergence. 

Yor applications it is appropriate to point out a number of suffi- 
cient tests for the uniform convergence of parameter-dependent 
improper integrals. 

Theorem 9.7 (Weierstrass test). Let a function f (x, y) be defined 
in 1 and integrable over x on any closed interval la, RJ for every y 
in fe, da]. Suppose further that for all the points of Th. 
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Then the convergence) of \ g (x) dx implies the uniform convergence of 


the integral (9.13) with respect to y on Ie, d). 

Proof. By the Cauchy criterion for the convergence of the integral 
of g (x) (see Theorem 3.1) given any ¢ > 0 we can find A > a such. 
that for all R’ > R' DSA 


R” 
\ g(x) dr<e. 
R’ 
Applying inequality (9.15) we get 
R” R? 
\ f(z, y) dx <| g(z)dr<e 
R’ R’ 


for every y in fe, dl]. 

This just means that the Cauchy criterion for the uniform conver- 
gence of the integral (9.13) hoids. 

Corollary. Let a function g(a, y) definedin a half-band Il. be 
bounded in that half-band and integrable over x on any interval [a, RJ. 
for each y € {c, d]. Then if the integral 


co 


\ (h(x) | dx 
converges, then so does uniformly with respect to y on [e, d] the integral 


{ p (x, y) & (x) dz. 
“To prove this it suffices to put in Theorem 9.7 | 
f(t, y)=9 (x, y) h(x), 8 (x)= Mh (z)|, where AT = sup ip (x, y) [e- 


Note that the Weierstrass test is a sufficient test for the uniform 
convergence of parameter-dependent improper integrals which guar- 
antees their absolute convergence. As in the proof of Theorem 3.4 we 
can establish the following sufficient test for uniform convergence- 
that is applicable to conditionally convergent integrals as well. The 
following statement (Abel-Dirichlet test) is true. 

Let a function f (x, y) be defined in a halj-band II ~, be integrable- 
over x on any closed interval la, R) for each y € [c, dj, and satisfy 
with some constant M > 0 the condition 


| f(t, y) at| <M. 
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Also suppose that the function g (x) defined for x > a and monotone 
moninereasing tends to sero as r ~+-t-oo. Then the improper integral 


fe a] 


{ f(x, y) 9 (x) dz 


a 
converges uniformly with respect to y on le, a). 

The next test for uniform convergence relates to integrals of non- 
negative functions. 

Theorem 9.8 (Dini test). Let f (x, y) be a function continuous and 
nonnegative in a half-band Wo. and suppose for each y Elec, d) the 
improper integral 


r(y)= \ f(z, y) dz (9.13) 


‘converges, 
Suppose further that I (y) is continuous on {c, d). Then the integral 
(9.13) converges uniformly with respect to y on [c, d}. 
Proof, Consider the sequonce of functions 
atn 
I, (y) = \ f(x, y) dz, 


a 


‘each by virtue of Theorem 9.41 continuous on [c, d]. Since tho inte- 
grand / (z, y) is nonnegative, 7, {y), being monotone nondecreasing, 
‘converges on [c, d] to the continuous function J (y). Consequently, 
by Theorem 1.5 (Dini test for functional sequences) the sequence /,(y) 
convorges to J (y) uniformly on {c, dj. This means that given any 
#2 > 0 we can find NV such that 


F(y)—ty (y) = | f(z, y)dzr<e 
a+N 
for all y of {c, d] at once. From the nonnegativity of f (z, y) it 
follows that for any R DSN - a and any y € {e, d] 


O< \ f(z, y)dzr<e 


St, a 


This just means that the integral (9.13) is uniformly convergent. 
Thus the theorem is proved. 

9.2.2. Continuity, integrability and differentiability of parameter- 
dependent improper integrals. The following two theorems are trues. 

Theorem 9.9. Let f (x, y) be continuous in IT,, and let the integral 
(9.13) converge’untformly on a closed interval {e, d]. Then that integral 
is a continuous function of y on {e, dl. 


Ch. 9. Integrais Dependent on Parameters 273 


Proof. Consider the sequence of functions 
a+n 
In )= | f(y) dz, 


a 


each by virtue of Theorem 9.1 continuous on [c, d]. Obviously the 
uniform convergence of the integral (9.13) implies the uniform 
convergence to J (y) of the functional sequence J,, (y). In such a case 
the continuity of J (y) follows from Theorem 1.7. 


Theorem 9.10. Let f(z, y) and its partial derivative st be 


continuous in Il~. Suppose further that for some y in [c, ad] 
the integral I(y)= | f(x, y) dx converges and {3 dx converges 


uniformly with respect to y on [c, a]. Under these conditions 
I (y) is differentiable on [c, d] and its derivative I’ {y) can be found 
from the formula 


I? (y) = \ oe de. (9.46) 


a 


In other words, under the hypotheses of the theorem differentiation 
with respect to a parameter can be carried out under the parameter- 
dependent improper integral. 

Proof. Consider the sequence of functions 


a+n 


In(= | fla, y)ae. 


a 


By Theorem 9.3 each of the functions J, (y) is differentiable on 
{c, d) and 


a+n 


’ Of (x, 
I’, (y) = \ TED dz. (9.47) 


From the hypotheses of the theorem it follows that the sequence 
of the integrals on the right of (9.17) converges uniformly on Ic, d). 
Consequently, uniformly converging to the same limit function is 
the sequence of the derivatives J; (y). Applying Theorem 1.9 we 
obtain equation (9.16). 

We prove a theorem on proper integration of the parameter-depen- 
dent improper integral. 

Theorem 9.11. If the hypotheses of Theorem 9.9 hold, then the 
integral (9.18) can be integrated with respect to the parameter y on 


18-—-01684 
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a closed interval |c, dj, with 
d d OG on d 
(rardy= {dy {fiz ydz= [ax | fe uydy. (9.18) 
c C a a € 


In other words, under the hypotheses of the theorem a parameter- 
dependent improper integral can be integrated with respect to the puram- 
eter under the improper integral. 

Proof. Since the hypotheses of Theorem 9.9 hold, the function 
J (y) is continuous on [c, d) and is therefore integrable on [c, d). 
We proceed to prove relations (9.18). 

Using the property of uniform convergence of the integral (9.43) 
we can, for a given e > 0, find A >a such that when 2? > A for 
every y in [e, d] 


fe 7) 


| re, y) dz | <e 
R 


. (9.19) 


Assuming further 7? > A and using the possibility of inverting the 
order of integration for parameter-dependent improper integrals we 
turn to the following obvious equations: 


d c R oo 
{ f(y) dy = Ri f(z, y)dz+ \ f(z, y) dee | dy = 
c c a i 


R d d ro) 
=f de[ Jie nay|+ | allie, ee]. 
a c c KR 


From these and inequality (9.19) it follows that for all 2 > A 


d A d 
| I{y) dy ~ | dz|[ \ f(z ¥) dy|<e 
which means that the improper integral ( dz f(z, y) dy over 


¢ 
d 
the variable x converges and equals the number J (y) dy. Thus 
c 


the theorem is proved. 
Remark. Obviously, in relation (9.18) we may substitute for the 
upper limit d of integration any number of the closed interval [c, d]. 
Corollary. If f(z, y) is continuous and nonnegative in Il. and 
the integral (9.13) is a continuous function on fe, ad), then formula 
(9.18) és true. 
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Indeed, under the requirements formulated all the conditions of 
the Dini test for the uniform convergence of the integral (9.13) hold 
(see Theorem 9.8). The statement of the corollary is thus true. 

We now prove a theorem on improper integration of the parameter- 
dependent improper integral. 

Theorem 9.12. Let f(z, y) be continuous and nonnegative when 
zo aand y>c. Also let the integrals 


oo 


I(y)= { f(z, y)dx and K (x)= \ f(x, y) dy 


be continuous for y Sc and x >a respectively. Then the convergence 
of one of the following two improper integrals 


oo 


| 2may= { ay f(z, y)dz and [x (2) dz= [arf tte y) dy 


c 
implies the convergence of the other and equality of the two integrals. 


Proof. Assume that the integral \ I (y) dy converges. We must 


prove that the integral { K (a) dx converges and equals \ I (y) dy. 


a ec 
In other words, it is necessary to prove that given any e >0 we 
can find 4 >a such that for R>A 


| (7 (y) dy — [ ( da | <e. (9.20) 
0 a 


It follows from the hypotheses of the theorem that given any fixed 
R >a the hypotheses of the corollary of Theorem 9.114 hold for 
f(z, y) in the half-band {a<7r< R, cx<y< oo}. ForanyR >a 
therefore 


f(z, y) dz. 


Q Ce, 
Baty] 


(Kear ax | (2, y) dy= {ay 


Using these equations and the convergence of the integral { I (y) dy 


c 
we transform the difference under the modulus sign in inequality 
18+ 
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(9.2U). For any & greater than c we write the equation 


oc k co Xx os te 
| I (y) dy— K (2) dz= | dy f(z, y)dz— { dy | f(z, yjdr= 


rf "! c nn 


dy | f(x, y)dzx. 
Re 


=| dy | ft, dz \ dy \ fe yde+ 
¢ Rr R 


(9.21) 
We proceed to evaluate the last integrals in relation (9.21). Since 


foe] 
under the hypothesis | I (y) dy converges, given &€ > 0, we can 
¢ 


find R >c such that O0O< ( I (y) dy < e€/2*. We replace J (u) 


Rt 

in these inequalities by its expression as an integral to obtain the 
oo 4 
R 


following inequalities: 0< \ dy Wace y) dz <e/2. From these 


n" 
and from the nonnegativity of f (z, y) we conclude that for a chosen 
R>cand any R >a 


0< | dy | fiz, u)dzr<ele. (9.22) 
A ORR 


We now fix 2 as shown above and use the arbitrariness of the choice 


of R. In the half-bhand {fa<zr<mo,cxy< R) the function 
j (x, y) satisfies all the conditions of the Dini test for the uniform 
convergence of improper integrals (see Theorem 9.8). Given e>f 


therefore we can choose A>> a in sucha way that for any R > A and 


for every y in a closed interval [c, R] we have 0 < { f(z, y)dr< 


e 


i 
<e/2 (r —c)** from which we obtain the following estimate: 


R 
O0<\ dy \ f(z, Mdz<el2. (9,23) 
c OR 
* The left-hand one of these inequalities follows from the nonnegativity of 
J (x, v) When x > a and y > e. 
** The left-hand one of these inequalities follows from the nonnegativity 
of f(r, vy) when > >a@ and y >. 
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Turning to the expression (9.21) and estimates (9.22) and (9.23) of 
the last integrals in (9.21) we sec that given an arbitrary e > 0 we 


can choose A >a in such a way that for any R >A we have in- 
equality (9.20). This completes the proof of the theorem. 

9,2.3. Parameter-dependent improper integrals of the second kind. 
We introduce the concept of parameter-dependent improper integrals 
of the second kind. Let f (z, y) be a function given in a half-open 
rectangle TI] ={a<iz<b,exy<d}. Assum that for any fixed 


y in (ce, d| the improper integral of the second kind f (x, y) dx 
a 
converges. Under these conditions the function 


b 
y= Vf (@, y) de (9.24) 


is defined on [c, d] called an improper integral of the second kind 
dependent on the parameter y. 

In the theory of such integrals an important role is played by the 
concept of uniform convergence. We shall formulate this notion. 

Definition. An improper integral (9.24) is said to be uniformly con- 
vergent with respect to parameter y on a closed interval {c, d) if it 
converges for every y in le, d] and given any € > 0 we can find 6 > 0 
dependent only on e such. that for any @ in the interval O<a<6 
and for every y in Ic, d} 


| f(z, y) dz|<e. 


b—a 


For improper integrals of the second kind it is easy to formulate 
and prove theorems on continuity, integrability and differentiability 
with respect to a parameter. 

Note that using the transformations of the variable x in Sec- 
tion 3.2.2 improper integrals of the second kind dependent of the 
parameter y can be reduced to parameter-dependent improper in- 
tegrals of the first kind. 


9,3. APPLICATION OF THE THEORY OF PARAMETER-DEPENDENT 
INTEGRALS TO THE EVALUATION OF IMPROPER INTEGRALS 


The operations on parameter-dependent improper integrals sub 
stantiated in the preceding section allow computation of various 
improper integrals. 

Consider examples of computing and investigating the properties 
of such integrals. 
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1°. Prove that the integral 
T (a) = jee 2B S10 de (9.25) 


whose integrand at a point x = 0 is by definition equal to unity 
converges uniformly with respect to a on a half-line 0 << a < oo. 
We first obtain some estimates. First note that 


-ax . 
. os c (a sin z-+-cos z) 
—Ax — 
\ e sinzdr= ry or: 


+C=O(a, z)+-C 


Clearly, for @ > 0 and z > 0 the function @ (a, x) (antiderivative 
for the function e-2* sin x) is bounded: 


& 


JD (a, ai<ptt<e, (9.26) 


Evaluate the following integral: 


enex SF gz (R>0). 


D+, 8 


Integrating by parts for any fixed a@ >0O we find 


i -~ox Sinz sin z dz| == -|[ = ae) |” 4 j Oe) dr\< 
R 


< 1b (a 2) a7 1m (a, z)I 


i = dz, 


K 


From this and inequality (9.26) we obtain the following estimate: 
oo co 
-ax Sinz 2 dx A - 
| { ax SA dal <z+2 \ 2 PR: (9.27) 
ii it 
This estimate implies that the integral (9.25) is uniformly conver- 
gent in a on 0 <a < co. Indeed, let e be an arbitrary positive 
number. Choose for that a number A > 0 so that 


_ 
aS 


It is clear that then, with R > 1, by the estimate (9.27) for every 
a> 0 


ay 

, sinz 
| | e-ax 2 dari<ce 
R 


thog 
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which implies uniform convergence in a on the half-line 0 < a < c 
of the integral (9.25) under investigation. 
2°. Use the conclusions just derived to compute the integral* 


[= \ SR? de, (9.28) 
0 


Note in the first place that the integral is the limiting value, as 
a—>0 +0, of the function J (a) defined by relation (9.25). Indeed, 
the integrand in (9.25) is continuous when « > 0 and x > O (when 
x = 0 it is assumed to be equal to unity) and the integral (9.25) 
uniformly converges in a on 0 < «@ < oo. By Theorem 9.9 therefore 
the integral (9.25) is a continuous function of @ on the half-line 
a> 0. It follows that 


oo 


lim J(a)=I= \ Sint ae, (9.29) 
a>0+0 z 


We obtain for the function J («) a special representation using which 
we shall find the value of the limit (9.29). That representation is 
obtained from the expression for the derivative I’ (~). We must first 
therefore see if it is possible to differentiate the integra] (9.25) with 
respect to the parameter « under the integral. To this end we check 
to see if the hypotheses of Theorem 9.13 hold when applied to the 
integral (9.25). It is obvious that the integrand and its partial deriv- 
ative are continuous in @ when a > 0 and z>0. We now turn 
to the question of uniform convergence in @ of the integral 


ew@X sin z ax (9.30) 


ot 8 


of the partial derivative of the integrand in (9.25). Fix any A > 0. 
Since | e-** sinz |< e74* for all a >A and since \ e-4* dz 


0 
converges, by the Weierstrass test (Theorem 9.7) the integral (9.30) 
converges uniformly in « when a > A. Since A is any positive num- 
ber, we can differentiate the integra] (9.25) under the integral with 
respect to the parameter @ for any «> 0. So when a >0 


e~"*X* sin x dz= — 


, 4 
I’ (a)=— Tar 


__ 
* The convergence of the integral under consideration was established in 
Section 3.4.2. 
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Integrating the left- and right-hand sides of the last relations we 
get for « >0 


[(a)=— \ —— —arc tana+C., (9.34) 


We seek the constant C. Since | == 
pression (9.25) we get for a>0- 


I (a)l<f e-s8de=z 


0 
from which it follows that 
lim JJ (a)| =0, 


As 

and therefore 
Jim J (a) =0. (9.32) 
ac 

Since lim arc tana = 2/2, from (9.31) and (9.52) we find that 


a- & 
= 1/2. So for a > 0 the function J (a) may he represented in the 
following form: 


I (a) =~ —are tan a. 


2 
From this and from formula (9.29) we evaluate the integral (9.28): 
[ lg. (9.33) 


a 


Remark. Consider the integral 
=| sin as (9.34) 


We evaluate this integra] for all possible a. 
We make a change of variables in the integral (9.34) for a > 0, 
setting az = y. Then 


K (a)= [ simar gy =|eha 
we make a change of variables fora < 0, setting ac = —y (y > 0). 
ien 
K(a)=—) St y= 2 


0 
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It is obvious that for @ = 0 the integra] (9.34) is equal to zero. So- 


oO n/2 when a>0Q 
K (a) = [A * ac= O when a=0 
0 —n/2 when a< 0. 


The integral we have considered is commonly called a discontinuous 
Dirichlet multiplier. 

Using the discontinuous Dirichlet multiplier we obtain the follow- 
ing analytic representation of the well-known function sgn @ which 
is usually termed the “a sign’™*: 

Sgn a 2 

st 


0 


sin ox 
x 


9.4, EULER INTEGRALS 


In this section we discuss some properties of important nonelemen- 
tary functions called Euler integrals**. 

A Euler integral of the first kind or ‘beta-function’ is the integral 

"1 
Bip, q)= \ P41 (1— 2)! de. (9.35) 
0 

In this integral p and qg are assumed to he parameters. If they 
satisfy the conditions p<(1 and qg<1, then the integral (9.35) 
is an improper integra] dependent on the parameters p and q, the 
singular points of the integra] being the points x = Q and z = 1. 

A Euler integral of the second kind or ‘gamma-function’ is the 
improper integral 

T (p)= \ ew xP! dg. (9.36) 

0 

Note that evaluating the integral (9.36) we must take into considera- 
tion that: (4) integration is over the half-line 0 < x < oo, (2) when 
p< 4 the point z = 0 is a singular point of the integrand (the 
integrand goes into infinity). 

In our discussion we shall take into account the above peculiarities: 
of the functions B (p, qg) and I (p). We shall see below that the 
integrals (9.35) and (9.36) converge for p >0Q and g > 0. 


* This term is due to the fact that the values of sgn « for a > 0, a = 0, 
and « <0 equal 1, 0, —1 respectively. 

** Vor a detailed account of Euler integrals sce: E.G. Whittaker and 
G.N. Watson, “A Course of Modern Analysis”. An introduction to the general 
theory of infinite processes and analytic functions; with an account of the 
principal transcendental functions. 4th ed., Cambridge, 1927. 
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9.4.1. The domain of convergence of Euler integrals. We prove 
that the function B (p, g) is. defined for all positive values of the 
parameters p and g and that TP (p) is defined for all positive values 
of p. 

We first consider B(p, g). For p > 1 and g 1 the integrand 
in relation (9.35) is continuous and the integral on the right of 
(9.35) is therefore proper. Thus B (p, g) is defined for all values of p 
and g mentioned. We now turn to the case where one or both of the 
following inequalities hold: 


O<p<1,0<q<1. (9.37) 


In this case one or both of the points x = 0 and x = 1 are singular 
points of the integrand. With this in mind we represent B (p, q) 
as follows: 


1/2 { 
Bip. d= | xP-1(f — 2x)?! dz -- | xP-l(f—2)*! dzr= 
0 1/2 
=1y(p, a) +12 (p, 9). 


Obviously, either of the integrals J, (p, g) and J. (p, g) has only 
one singular point. 
1/2 


For the integral 7, (p, q) = { x?-1 (4 — x)? dx the singular 


0 
point is the point z = 0. Noting that on the closed interval {0, 1/2] 
the function (1 — x)?! is continuous and is therefore bounded by 
some constant C, it is easy to see that the function Cx?-} is majorant 
for the integrand of J, (p, g). It follows that J, (p, g) converges for 
QO<p<—<i1 and any q. Reasoning similarly it is casy to see that 
I.(p, q) converges for O< gq <i and any p. 

So we have seen that in the case where the inequalities p > 0 
and g > 0 hold the integral (9.35) convorges, i.a. the function B (p, 9) 
is defined for all positive values of p and q. 

We now turn to the function I (p). We have already noted that 
the integral (9.36) has two types of peculiarities: integration over 
a half-line and a singular point z = (). To separate these peculiarities 
we divide the domain of integration into two parts so that there is 
only one of those peculiarities in either. For example, we may repre- 
sent PF (p) as follows: 


I (p) = { en* Ph dz te \ ear dxz=T, (p)+/,(p). 
0 


on ony 3 


Since [eva | <a?! for x > 0, by the partial comparison test 
I, (Pp) converges when p> 0. The integral 7, (p) also converges 
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when p > 0. To show this we can use the partial comparison test 
in the limit form: Jim e-*z’ = U for any r. So we have proved 


x +4 0 
that the domain of TF (p) is the half-Jine p > 0. 

9.4.2. The continuity of Euler integrals. We prove that B (p, q) 
is continuous in the quadrant p > 0, g > 0 and that FV (p) is conti- 
nuous on the half-line p > 0. We first consider B (p, g). To prove 
that it is continuous in the quadrant p > 0, ¢ > 0 it is obviously 
sufficient to show that the integral (9.35) is uniformly converyent 
with respect to the parameters p and g when p> pp > 0 and 
9 == 9% > 1 for any fixed positive values of pg and go. Since py — 
~i{<scp—1, m—-i<q—4, for Uri 


PPV (4 — az) arre- i (f — za), 
! 
From these inequalities and from the convergence of [ar —z)le! dx 


Q 
it follows by virtue of the Weierstrass test that the integral (9.35) 
is uniformly convergent for the indicated values of p and g. Thus 
the continuity of B (py, q) for p> 0 and gq > 0 is proved. 

To prove that P(p) is continuous on the half-line p> 0 it is 
obvionsly sufficient to establish that the integral (9.36) is uniformly 
convergent with respect to p when 0 -< py <p <p, for any fixed 
values of py, and p, satisfying the condition 0 < po < p,. Since 
for the indicated values of p, pg, and p, and for z > 0 


eS Pht < em* [zPent yim fy, 


the convergence of the integral 
a 
[ e7* [xPo-f  zPi~ 3} dx 
rf 


implies, by the Weierstrass test, the uniform convergence of the 
integral (9,36) for the indicated values of p. Thus the continuity of 
P(p) for p > 0 is proved. 

9.4.3. Some properties of the function I’ (p). Hore we shall prove 
that TF (p) has a derivative of any order. We shall also obtain for 
P (p) a formula called a reduction formula. 

Differentiating I (p) with respect to {he parameter under the 
integral we obtain the following integral 


fo] 


{ 2P-te* In ede (9.38) 
0 


which converges uniformly in p on any closed intervalO << pp p< 
<p, Indeed, the absolute value of the integrand in the integral 
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(9.38) satisfies on 0 < rz < co the inequality 
JxP-te* In z[<c e™* JI n x] (xPo-§ = Pi 1), 
Hence the convergence of the integral 


is © 


| e~* |In x] (zPe-! - aPi-!) dx 


0 


implies according to the Weierstrass test the uniform convergence of 
the integral (9.38). This fact, together with the continuity of the 
integrand in the integral (9.38)* when O0<xr<0o, ON<p<o, 
allows us to conclude that it is possible to differentiate F (py) with 
respect to the parameter under the integral. So the derivative T’ (p) 
exists and is equal to the expression (9.38). 

Reasoning similarly it is easy to show that I (p) has a derivative 
of any order and that that derivative can be found by differentiating 
with respect to the parameter p under the integral in the expression 
(9.36) for I (p). 

We now proceed to derive a reduction forniula for TF (p). 

Applying the formula of integration by parts to TF (p -+ 4), with 
p> 0. we get 


P(p+1)= | zPem* dx = {[ ~ xPe~*]> +- p x?-te-* dr= pl (p). 
0 0 
So for any p > 0 
TY (p +1) = pY (p). (9.39) 


Applying successively formula (9.39) for any p> — 1 and any 
natural 2 we get 


Tip +i) =p(p—i)... (p—n+t)!] (p—n 4+ 1). (£1.40) 
Relation (9.40) is called a reduction formula for VY (p). Using (9.40) 
the gamma-{unction for values of the independent variable greater 
than unity is “reduced” to the gamma-function for values of the inde- 
pendent variable between zero and unity. 

Since [ (1) = fe dx = 1, setting in (9.40) p =n wei get 

0} 

Tin+ti)=n(n—1)... 2f1=_n! 
This formula wil] be used below in deriving the so-called Stirling** 
formula giving an asymptotic representation for nl 

The information obtained for the function IT (p) allows us to 
characterize qualitatively the graph of this function. We shall 


* This function is a partial derivative with respect to p of the integrand in 
the expression (9.36) for Pp). | . 
** James Stirling (1692-1770) is a Scottish mathematician, 
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make a geometrical study of the graph of [(p) following in the 
main the pattern presented in Section 9.6 of [1]. 

We have established that the domain of [ (p) is the half-line 
O<cp<oo. The function [ (p) is continuous and differentiable 
any number of times on the half-line, it being possible to find any 
derivative by differentiating the oxpression (9.36) for [ (p) with 
respect to the parameter p under the integral. In particular, the 
second derivative I” (p) is 

CU" (p)= \ xP-t (In 2)? e* dx. 

0 
Since I” (p) => 0, the first derivative I’ (p) may have only one zero. 
Since [ (1) = [ (2)*, by the Rolle theorem that zero of I’ (p) 
exists and is in the interval (1, 2). Since I” (p) > 0, at the point 
where I’ (p) vanishes I (p) has a minimum. Note also that the graph 
of [ (p) is convex down. The graph of [ (p) has a vertical asymp- 
tote at the point p = 0. Indeed, since ['(1) = 141 and [(p)= 
= [(p + 1)/p, the continuity of T (p) at the point 1 implies that 
Tr o) 60 as p +0 + 0. Obviously, [ (p)—> +090 as p + +o. 
Note without proof that the graph of I’(p) has no_ inclined 
asymptotes. 

9.4.4. Some properties of the function B (p,q). Here we shall 
establish the symmetry properties of B(p, qg) and its reduction 
formula. 

We make a change of variable in the integral (9.35), setting z = 
= {1 —?. On making the necessary computations we shall see that 


B(p. 9) = BQ, P) (9.41) 
which expresses the symmetry property of the function B (p, q). 
_ We establish reduction formulas for B (p, .g). To do this we turn 
to the function B(p, gq + 1), assuming p and gq to he positive. 
Applying integration by parts and the formula 2? = z?-t — 
— g?-l ({ — z) we get 

t 
B(p,q-+4)= | 2? (1—2)tde= . 


0 
{ 
_ zP 1 q p ~ 
= atoatles _ P12)! de= 


Bn, 128 (rs oe 
'  * This follows from relation (9.39). 
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From these relations we obtain the following formula: 


B(p. q+ 1)= 7 B(p. 9). (9.42) 
Quite similarly for p>Oand g>0 
B(p+1,q)= ==> B (p, 9). (9.43) 


Formulas (9.42) and (9.43) are called reduction formulas for the 
function B (p, qg). Successive application of these formulas reduces 
computing B (p, g) for arbitrary positive values of the independent 
variables to computing B (p, qg) for values of the independent 
variables in the half-open square O< p<i1, O<iq<l. 
9.4.5. The relation between Euler integrals. We make a change 
of variable in the integral (9.35), setting z= i/(1 + t). Asa result 
we obtain for B(p, g) the following expression: 


tq} 


B(p, g)= i qe te (9.44) 
Using formula (9.41), along with (9.44) we obtain the following 
expression for B (p, q): 


¢  ¢P- - 
B(p, q)= \ ayer. (9.43) 
0 


We now turn to the expression (9.36) for I (p). Using the substi- 
tution z = ty, t >> 0, we convert that expression to the form 


P(P) \ e~yP-1 dy, (9.46) 
0 


Replacing in this formula p by p + q¢ and t by 1 + ¢ we eget 


Le =] 
Y(p+g¢) | —(14t)¥,,P49~$ 
pap = | eS Ye a. 


We multiply both sides of the last equation by 7”! and integrate 
with respect to ¢ between 0 and oo. Obviously, according to relation 
(9.45) we obtain the formula 


P (p-+q)B(p, q)= | at { yP-yP tee ay, (9.47) 
an) 


If on the right of relation (9.47) we may interchange the order of 
integration with respect to ¢ and y, then, takingfinto account (9.46), 
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we get 


oo 


Y(p+q) Bip, d= \ yPtI-leU dy | {Prig-U dt = 
0 0 


=f prorten FO ay 7 (p) | ye dy =P (p) F(a), 
0 0 


i.e. prove the validity of the formula 


T (p)-P 
B(p, 9) =o. (9.48). 


We shall now show that it is possible to change the order of integra- 
tion on the right of (9.47). To do this it is necessary to verify that. 
the hypotheses of Theorem 9.12 hold. First let p > 1 and gq > 1. 
Then obviously the hypotheses of Theorem 9.12 hold. Indeed, 
(1) the function f (¢, y) = t?JyP?t?-te-G+hY is nonnegative and 
continuous in the quadrant t>0, y > 0. 
(2) The integral | f(t, y) dy = 1? 
0 
is a continuous function of ¢ for ¢>0. 


yPt9-1e- (t+ Ou dy = 


ot B 


_ T'(p-q) tP7} 
~ (A-+ 2)Pta 


(3) The integral { f(t, y) dt = yP*9-fe-¥ { tP-te-'V dy = T (p)ytteV 
0 0 
is a continuous function of y for y>0. 


(4) The convergence of \ dy f(t, y)dtis established by straight- 
0 8600 
forward computation. 
So for p > 1 and g > 1 formula (9.48) is true. But if only the 
conditions p >0 and g>0 hold, then by that which has been. 
proved 


Yr 4)T 4 
B(p-+4, ¢+1) =e Pe, 


Using the reduction formulas for B (p, q) and [ (p) we again obtain 
from this (9.48). 

9.4.6. Evaluation of definite integrals by means of Euler integrals. 
Kuler integrals are well-studied nonelementary functions. A problem 
is considered to be solved if it can be reduced to computing Euler 
integrals. 

Here are some examples offcomputing ordinary and improper 
integrals by reducing them to@EKuler integrals. 
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1. Compute the integral 
T= \ of +z)? dz. 
" 


Turning to formulas (9.44) and (9.48) we obviously get 
_p(S 38)_TOMPGQM _ tp¢ typ 33 
=B (4. $)= Penge = t1r(4)r(2) 


4? 4 T' (2) 4 f 4 
2. Compute the integral 
1/2 
I= { sin?-! pcos?! @ dq. 
0 
Setting x = sin*@ we get 
1 op 7 
—-i1 =a 
Tat {22 (1—z)?'de= 
0 
Pp q 
==B (4 +) 1 T(F)r(s) 
2 2°’ 2) 2 (242) . 
rT {-_ 
2 
3. Consider the integral 
x/2 
Tp = | sin?-! ~ dy. 
0 


Using the result obtained in Example 2 (it is necessary to set g = 1) 
~ve find 


. tity T(4) 
| sin?-!pdg = — r (+)—h- (9.49) 
r (2) 


We further have 


oo 
r(+)= | e-* 27. 2 [ en (VE gz, 
. aes . 
0 0 
to 


Setting Vz = ¢ and noticing that | e“" dt equals 1/2 [ e-! dt we 


0 -© 
get, according to the example considered in Section 3.4.2 (Poisson 
integral), 


r()=V5 
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Formula (9.49) therefore becomes 
P 
e Va ( 2 


| sin?! pdg="Z ptt) ; (9.50) 
0 


.—ee 


[p4i= 


no 


9.5. THE STIRLING FORMULA 


The term “Stirling formula” is applied to the following asymptotic 


formula: 
nl = V dnnn"e™ (1 + Gn); 


oro 
vie cn rove here a more general formula describing as pre- 


cisely as desired the bebaviour of the Euler gamma-function for large 
values of the independent variable: 


(9.54) 


PaA+ti= | the dt. (9.52) 
0 


To do this we use the so-called Laplace method based on the fol- 


lowing statement. a 
Lemma. Let f (t) be @ function integrable for some a> 0 on a closed 


interval [—a, a) and representable as 


2n—1 
f(t)= >) ¢,t*+ 0 (t*"). (9.53) 


hk=0 


Then the following asymptotic formula holds: 


4 
l — Tims), od 
| e-Alt (t) dt= > sop Mt) 2 (9.54) 
~a m=0 ee wal 7 


Proof. Substitute relation (9.53) in the integral on the left of 
(9.54) and take into account the fact that the integrals corresponding 
to odd powers of ¢ vanish. To evaluate the remaining integrals it 
suffices to show that for m >0 


4 
a YT { m+—~ 
\ {2M p—Att Pe — = *{nt+z) -+-O (ea) , (9.55) 
0 mt> 


19—91684 
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Represent the integral on the left of (9.55) in the following form: 


oo 


a co 
{ {2Men~At* dt = \ 2™e-h dt — \ {2Me-)" dt, (9.56) 
0 0 a 


Make the substitution x = At? in the first integral on the right 
of (9.56) to get 


\ #2M——M* Ct (9.57) 
0 


Further note that for A>1 and t>a 
en? = (2—1)a%e-1", 


Applying this inequality, evaluate the second integral on the 
right of (9.56) 


os oo 
{ en M22 p< p- (1)? ) He -8 dp = ce 2a?, (9.58) 
a 


a 


Equations (9.56), (9.57) and the estimate (9.58) yield the required 
formula (9.55), which proves the Jemma. 

To apply the lemma make the substitution ¢ = 2% (1 +z) in 
the integral (9.52). As a result this becomes 


oo 


Tat) =n tte | e-Mx-In1t2)} dz, (9.59) 
~4 
Denote by g(z) the following function defined on the half{-line 
x> —t: 
g(x) =senz-Vz2—n(1+2). (9.60) 
Then equation (9.59) may he rewritten in the form 


Pat A) =n tte | e-h0l2) dr, (9.64) 
4 
Our aim is to study the asymptotic behaviour of the following 
integral as 4 —»-+oo: 
T(a)= | en) dz, (9.62) 


—{ 
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To do this we consider in more detail the function g (x) defined 
by equation (9.60). Since 


d 
 o°(2) = (2—In (1+ a))=7a>, (9.63) 


the function g® (x) is strictly decreasing when —1<2< 0 and 
strictly increasing when z > 0. It follows that g (x) is strictly 
increasing on the half-line « >> —4, its range of values being the 
entire number line. Further, since g® (x) has in the neighbourhood 
of the point z = 0 the expansion 


#(2)=2—In (1-+2)=2—(2-4 +40 (2) =F +0 (24), 


there is a function h (x) strictly positive for z > —1 such that 

g? (x) = 2* h (2). 

The function hk (xz) is infinitely differentiable for z > —1, conse- 
quently so is the function g (z) = x Vh (2). 

Considering the foregoing one may say that for the function 
y = g (x) defined by equation (9.60) there is an inverse function 

= g"(y) strictly increasing and infinitely differentiable on the 
entire number line and satisfying the condition g7! (0) = 0. 

Denote that inverse function by z = g (y). Using the above pro- 
perties of the function find the asymptotics of the integral] (9.62). 
The following theorem is true. 

Theorem 9.13. Let x = op (y) be the inverse of the function y = g(z) 
defined by equation (9.60). Then for the integral (9.62) the following 
asymptotic formula is true with any n: . 


n=1 C (m+5-) 
_ p(i2m+1) (0) D) O (4) 
I (y= > Gm ek (9.64) 
mu= y 2 


Proof. Fix an arbitrary positive number a and set b = @ (—a), 
Cc = (a). This means that a = g (c) = —g (b) and therefore —14 < 
<b<OQandc>0. 

Evaluate the following two integrals 


b co 
I, (x)= | e~Ae=) dx, In (A) = \ e-hel) der. (9.65) 


=| Cc 


To evaluate the first integral note that for —1 < xz < b we have 
g (xz) << —a, i.e. g* (x) > a?, and therefore 


e- Ag?(x) <er7 Ac? | 


19% 
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a 
In such a case 


1, ()<ev | dr=(1—|b])en*™. (9.66) 


t 
————n °C 


The ev aluation of J, (4) is similar. For z > ec we have g (x) >a, 
i.e. gr (z) > a@. Therefore, for 3X >1 and z>e 


erm BSX) xe OW (A= VEZ) - BX) a e-(A~ Late —87(x), 


Trom this we get 
Ta (h) < e7 Aha [e ~El™) dr=ecye7*™, (9.67) 


From the estimates (9.66) and (9.67) satisfied by the integrals 
(9.65) we obtain for the integral (9.62) the following relation: 


ie) 


TQ) = \ e-P8t2) de + O (e-28"), (9.68) 
b 


Make a change of variable t = g (2) in the integral] (9.68), i.c. 2 = 
= p(t). As a result we get 


I(x) = | e-*Mt@! (t) dt +O (e-*"). (9.69) 


Since the function q’ (t) is infinitely differentiable, use the Maclau- 
rin formula to represent q’ (¢) as 


2n—1 
hd (0) . 
(p' (t)= >) et +O (1%). 


h=0 


To obtain formula (9.64) it remains to apply the Iemma to the 
function f (t) = m’ (¢). Thus Theorem 9.413 is proved. 

In conclusion we describe the following simple way of computing 
pp) (0). From (9.63) we get 


~ p (t) 


—— 


Qo.go = = 
Co St @(+i * 


This implies the relation 


{+9() _ 95, 14+ 9(2) 
Oia TO 
We thus obtain the following equation: 


p(t) p’ (t) = 2t + 2tep (2). (9.70) 


; 4 
@ (t) = or = 28 
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Differentiating successively (9.70) and setting ¢ = 0 we deter- 
mine all gl) (QO). As an illustration, wo evaluate the first three deri- 
vatives of @ (t) at zero. 

On differentiating (9.70) we got 

[p’ (A) +p) pf) = 2 + 2 (tp (1) + &@ CY). . (9.74) 

Set ¢ = 0 and consider that m (0) = 0. Then p* (0) = 2, ive. 
g’ (0) = V2. 

After differentiating equation (9.74) we get 

3p’ sp" + peg” = 2 te" + 29"). 

On equating t to zero we get 3 V2” (0) = 42, ie. @” (0) = 
= 4/3. Similarly from 

Bap"? Ap! fe pe TY = 2 (1p" + 3G") 
wo got g” (0) = 2/3. 

Consequently formula (9.64) may he written as 


ya PUM), V2 132) 94) 9.72 
M(\=\2 V2. Ea VE (S74) 
Substitute equation (9.72) in (9.61) and consider that I (1/2) = 

= Wx, (3/2) = 4/28 (4/2) = Vanl2. As a result we get 


PA+A)= Vaan e-*(4 + + SO), (9.73) 


Write out the first five torms of the asymptotic expansion of the 
Euler gamma-function: 

’ — 1/5 ee (0 Ta 

PA i= Vana ve (4 L ay s Sag78 

___ 139 _ Sy 4 O (i) } 

51 8402. 2 488 32044 2° ° 

Note without proof that the remainder of an asymptotic serics 

does not exceed the last term retained. 


9.6. PARAMETER-DEPENDENT MULTIPLE INTEGRALS 


9.6.4. Parameter-dependent proper multiple integrals. Let w= 
= (%}, Ley - +s) Sm) be an arbitrary point in a domain D of an 
m-dimensional Euclidean space £™ and let y = (yy, Yor + «a Ya) 
be a point in a domain Q of a space £'. Denoto by D *K Qa subsot 
of an (1 + m)-dimensional Iuclidenn space consisting of all points 
Z == (2, Zo, - - oy Smt) SuCh that the point (z,, 2., ..., 2m) is in 
D and the point (2m43, 2m+eor - ++) Zm4z) iS in Q. We shall often 
use the notation z = (zr, y)€ D X Q. The closure of a domain D 
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will be designated D.\ is easy to see that the closure of D & Q 


— 


coincides with D X Q, 
Let 7 (z, y) be a function defined in D X Q, the function f (2, yp) 
being integrable over 2 for any yp €Q in D. Then the function 


Iu) = \ f(z, y)de (9.74) 


D 


defined in Q will be called the integral dependent on the parameter y. 
Notice that the parameter y is an /-dimensional vector and the inte- 
gral (9.74) is therefore dependent on / numerical parameters y,, Yo) » 

ory Ye 

In close analogy with Theorems 9.9 to 9.42 the following theorems 
are proved. 

Theorem 9.14 (on continuity of the integral (9.74) in the para- 
meter). If f (x, y)is continuous in a collection of independent variables 
in a closed domain D X Q, then the integral (9.74) is a continuous 


function of the parameter y in the domain Q, 

Theorem 9.15 (on integration of the integral (9.74) with respect 
to the parameter). If f (x, y) is continuous in acollection of indepen- 
dent variables in a closed domain D X Q, then the function (9. 74) 
can be integrated with respeci to the parameter under the integral, i.c. 


\ I (y) dy = | ae | f (x, ) dy. 


re D 


Theorem 9.16 (on differentiability of the integral (9.74) 
with respect to the parameter). If f(x, y) and its partial deri- 
af 


vative By, Are continuous in DXQ, then the integral (9.74) has 


: . . . 7] . . 
a continuous partial fee a in Q, with 
_ 5 Sa 
Sr ~~ OUn 


9.6.2. Parameter-dependent improper multiple integrals. Tho con- 
cept of parameter-dependent improper multiple integral could he 
introduced, as in the preceding subsection, for the case where the 
interrand j (c, y) is defined in D X Q, where D < E™ and Qc EF, 
Of the sroatest interest, however, is the caso D = 2 10 he studied 
here. We shall also assume that f(z, y) = F (x, y) g (z), where 
F(x, y) is continuous for s=4£y in D X D and g (z) is bounded 
in D. Thus we are considering integrals of the form 


Viy)= { F(z, u) g(x) dz (9.75) 
D 
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where the integrand may have singularities for z = y only. We shall 
be concerned with continuity of integrals of the form (9.75) in the 
parameter y. In this connection we introduce the following definition 
of uniform convergence of the integral (9.75) at a point. We shall use 
K (yy, 6) to denote a ball of radius 6 with centre at a point yp. 

Definition. An integral (9.75) is said to be convergent uniformly 
in the parameter y at a point y, € D if given any e=> 0 we can find 
6>0 such that K (yo, 6) < D and for any cubable domain oc 
< K (yo, 8) and each point y € K (Yo, 9) 


| Fiz, ybge(z)dxl<e. 


Theorem 9.17. If theintegral (9.75) converges uniformly in y at 
a point yo € D, then it is continuous at the point Yo. 

Proof. We want to prove that for any & > 0 there is 6 > 0 such 
that [7 (y) — V (yo) | << e when | y — yo | < 6. From the definition 
of uniform convergence at a point it follows that there is 5, > 0 
such that K (yo, 5,) cq D and that for y € K (yo, §;) 


|| F(x, ve (a) de | <e/3. (9.76) 
K 


Set 


Vi | F(x, v) eta) az, 


J 


(9.77) 
Ve (y) = | F (x, y) & (x) dz. 
D/K 
From inequality (9.76) it follows that for |y — yp | < 8, 
IVs (| <e/s. (9.78) 


Further note that for c€D\ K (yo, 6,) and y € K (Yo, 83/2) 
the function F (z, y) is uniformly continuous in a collection of inde- 
pendent variables. Hence there is a positive number 6 < 6,/2 such 
that for Jy —y |< 4 


IF (x, yo) — F (x, y) | <e/3af | D |, 


where Jf is a constant bounding the function g and | D |is the volume 
of a domain D. In such a caso for |y — yy | <6 


Vey—Vevi<@ | [FF (2, y)—F (a, yl de<el3. 
D\ Kyo, 51) 
(9.79) 
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From relations (9.77) to (9.79) it follows that for] y—u,|< 4 
LV (vy) — Vo) LUV t+ 1 a (Yo) | + IVs (Cy) — Ve (Yo) L<e. 


Thus the theorem is proved. 

We show one sufficient condition for the uniform convergence 
of an integral at a point that is most commonly occurring in appli- 
cations. _ _ 

Theorem 9.18. Let F (x, y) be a function continuous in D X D 
for x =y and let g (x) be a function uniformly bounded in D. Sup- 
pose that there are constants 2}, O<A <0 m, and c>0 such that 
for all r€D, yED 

[F(z, y|<ela—y]* (9.80) 


Then the integral (9.75) converges uniformly in y at each pointy ,€D. 

Proof. Let yo be an arbitrary point of D. We want to prove that 
given any € > 0 wecan find 6 > 0 such that for any cubable domain 
aoc K (yo, 5) and each y € K (yo, 9) 


| ( F (x, y) g(x) dx | <v. (9.81) 


Using (9.80) and the condition for the boundedness of g (a) we get 
| \ F(x, y) & (2) de| < Cy j jz—y|~" de. 


Fix a pointy € K (yo, 5) and note that the condition w Cc K (yp, §) 
implies the inclusion w c K (y, 28). Hence 
}\ Fix, we(aaz|<e, J |2—y|7*ee. (9.82) 
0 K(y, 26) 


Transforming in the integral on the right of (9.82) to spherical 
coordinates with contre at a point y (see Section 2.5.3°) we get 


26 
| | F(z, y) & (2) dz |< Cy | rm—1~2 dp = A =¢,67~*. 
o 0 


It, follows that on choosing 6 to be sufficiently small we obtain 
inequality (9.84). Thus the theorem is proved. 

9.6.3. Application to Newton potential theory. Suppose a mass my 
is placed at some point P, (x, y, 2). By the law of gravitation 
a mass m placed at a point AT (E, 4, €) is acted on by a force 

yo MMg ~— 

B= — aE Ts 
where R=p (Po, M) = V(z—tP + (y— nF +E—O y is 
the gravitational constant, and r= R/R is a unit vector whose 
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direction coincides with that of PyM. Assuming y = 1 and the 
mass m = 1 we obtain the gravitational force 


Fo ~ r 
Note that the components of this force are of the form 
X= —i(E—2), 
Y=—3r (1-9); 
Z= —Fy (b—2). 


It is obvious that the potential of gravitation defined as a scalar 
function wu such that # = grad u equals 
— Mo 

U=—. 
If the mass is concentrated not at a point Py (x, y, z) but is distri- 
buted over a domain D with density p (x, y, z), then for the poten- 
tial of gravitation and for the components of gravitation we obtain 
the following expression: 


un, = | | { PE b 2) de dy de, (9.83) 


mH LC yo ce z) (E— x) dx dy dz, 


8 


jf p+) | 
| 

=—{[ [2G 29 (9 y)dedy dz, | (9.84) 
J 


an Pw UD (C2) de dy de. 


It is not hard to show that the integrals (9.84) are partial derivat- 
ives of the potential (9.83). Since the integrands in the integrals 
(9.83) and (9.84) are majorized by the function C/R*, where * = 4 
for the integral (9.83) and 7 = 2 for the integrals<(9. '84), by Theo- 
rem 9.18 the integrals converge uniformly at every point M (&, 1, ©). 
Hence, by Theorem 9.17 they are continuous functions of the point 


M (E, 1, &). 


CHAPTER 10 


FOURIER SERIES AND 
FOURIER INTEGRAL 


It is known from linear algebra that if we choose some basis in 
a linear space of finite dimension, then any element of that space can 
be expanded with respect to that basis (uniquely). 

Far more complicated is the question of the choice of basis and 
expansion with respect to it for the case of an infinite-dimensional 
space. 

In the present chapter this question is studied for the case of tho 
so-called Euclidean infinite-dimensional spaces and for basos of spe- 
cial form (the so-called orthonormal bases). 

Particular attention is given to the study of the basis formed by 
the so-called trigonometric system in the space of all piecewise con- 
tinuous functions. 

A generalization of the idea of expansion of a function with respect 
to a basis is the expansion to be studied in the present chapter of 
a function into what is called a Fourier* integral. 

Throughout this chapter the integra] is understood in the Riemann 
sonse. 


10.4. ORTHONORMAL SYSTEMS AND THE GENERAL 
FOURIER SERIES 


In the present section we shall consider an arbitrary Euclidean 
space of infinite dimension*’ . For convenience wo give the definition 
of a ISuclidean space 

Definition I. A linear space f is said to be Euclidean if the 
following two requirements are met: 

(4) arule is known by which any two elements f and g of R are as- 
signed a number called a scalar product of those elements and designated 

» &)s 
v 5 the rule satisfies the following axioms: 
1°. (f, g) = (2, f) (commutative property). 


* Joseph Fourier (1768-1830) is a French mathematician. 
** A Jinear space is said to be tnfintte-dimenstonal if there is any preuccigned 
aumber af linearly independent elements in it. 
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2. (f + g, hk) = Ff, h) + (g, bh) (distributivity). 

3°. (Af, g) = ACh, g) for any real number kh. 

Lf, fp >O if Fe O*, (f, f) =O if f=. 

A classical example of an infinite-dimensional Euclidean space 
is the space of all functions piecewise continuous on some closed inter- 
vala<r<sb. 

Let us agree throughout this chapter to mean by a function f (z) 
piecewise continuous on a closed interval {a, b] a function such that 
is continuous everywhere on [a, bj, except possibly a finite number 
of points z; (i = 1, 2, ..., m) at which it has a discontinuity of 
the first kind, and satishes at every discontinuity point x; the condi- 
tion 

f (x)= Leer (10.4) 


Thus throughout this chapter we require that a piecewise continuous 
function f (x) should satisfy condition (10.4) at every discontinuity 
point x;, i.e. should equal a half-sum of the right~ and left-hand li- 
mniting values. Note that a condition of the type (10.1) is automat- 
ically true at each continuity point of f (z). 

A scalar product of any two elements f (x) and g (x) of the space 
of all functions piecewise continuous on a < x < b will be defined 
as follows: 


b 
(f, a= J f(z) g (2) de. (10.2) 


There is no doubt that the integral (10.2) of a product of two piece- 
wise continuous functions exists. It is easy to verify that Axioms 4° to 
4° are valid for the scalar product (10.2). Axiom 1° is obvious. Axioms 
2° and 3° follow from the linear properties of the integral. 

We prove the validity of Axiom 4°. Since it is obvious that always 


i, f= | f° (xz) dx >O0 it suffices to establish that from (7, f) = 


a 
b 


= \ f° (x) dx = 0 it follows that f (x) = 0, i.e. is the zero element 


a 
of the space under study. Since f (x) is piecewise continuous on 
fa, b], this interval falls into a finite number of subintervals 
[z;_,, 2;) on each of which f (z) is continuous**. 


* 0 stands for the zero element of a linear space. 
** The values of f (x) at the end points z;_, and x; of every interval [x;_1, z;] 
are sct equal to the limiting values f (z;., + 0) and f (x, — 0) respectively. 


300 Fundamentals of Mathematical Analysts 


From \ f° (z) dz = O it follows that for every subinterval [z;_,, 7,] 
{00 " 
a 
| #(x)dz=0. (10.3) 
Xt 


But from (10.3) and from the continuity of f (z) on {z,_,, z;] it 
follows that 7 (z)=0 on [z;_,, z,]*. 

Since the last equation relates to every subinterval and relation 
(10.1) is true at discontinuity points, f (z}) = 0 throughout [a, 0). 
The validity of Axiom 4° is established. 

This proves that the space of all functions piecewise continuous 
on {a, b}] is a Euclidean space with scalar product (40.2). 

We establish the following general property of any Euclidean 
space. 

Theorem 10.1. In any Euclidean space, for any two elements f and g 
we have the inequality 


7, gy <<, A-(g, 8) (410.4) 


which is called the Cauchy-Buniakowski inequality. 
Proof. For any real number 7 


(xf — g, of — 2) > 0. 
By virtue of Axioms 1° to 4° the last inequality mav be rewritten as 
n+ (f, h) — 21. (f, g) — (g, &) = 0. 


A necessary and sufficient condition of nonnegativeness of the last 
square trinomial is the nonpositiveness of its discriminant, i.e. 
the inequality 


G, gs? —(f, f)-(g, 2) <0. (10.9) 


This immediately yields (10.4). Thus the theorem is proved. 

Our next task is to introduce in the Euclidean space under study 
the concept of norm of every element. 

But first we recall the definition of a normed linear space. 

Definition 2. A linear space R is said to be normed if the following 
two requirements are met: 

(1) a rule is known by which each element f of R is assigned a real 
number called the norm of the element and designated }\ jf |j; 

(2) the rule satisfies the following azioms: 


* For it bas been proved in Section 10.6 of [4] that if a function is con- 
tinuous, nonnegative and is not identically zero on a given closed interval, then 
the integral of that function over the civen interval is greater than zero. 
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°. Wf ll>O if fA, fll =O if f=. 


2°. PAs I] = FA d-UL7 Il for any element f and any real number i, 
3°. For any two elements f and g we have the inequality 
lftell<iftlt+ lel (10.6) 


called the triangle inequality (or Minkowski inequality). 
Theorem 10.2. Any Euclidean space is a normed space if the norm 
of any element f is defined in it by the equation 


W=VG P- (10.7) 


Proof. It suffices to show that for the norm defined by relation 
(10.7) Axioms 4° to 3° of Definition 2 are true. 

The validity of Axiom 1° follows immediately from Axiom 4° for 
a scalar product. The validity of Axiom 2° also follows almost im- 
mediately from Axioms 4° and 3° for a scalar product. 

It remains to show the validity of Axiom 3°, i.e. of inequality 
(10.6). We shall rely on the Cauchy-Buniakowski inequality (10.4) 
which we rewrite as 


Mf, QI<VE AVE 2) 


Using the last inequality, Axioms 4° to 4° for a scalar product and 
the definition of the norm (10.7) we get 


Wftell=VG+e, f+ 0=VEA+26, Of, Ox 
<Vi,A+2VGN-Ve, ote, 2= 
=ViIVG, A+VE-. O2=VEALEVG O=Uitilt+iell- 


Thus the theorem is proved. 

Remark. There are certainly more ways than one to introducing 
a scalar product (and norm) in every Luclidean space. It is sufficient 
for our purpose in what follaws that there is at least one method 
of introducing a scalar product in the Euclidean space under consid- 
eration. On fixing that method, in what follows we shall always 
define the norm of a Euclidean space under consideration by rela- 
tion (10.7). Thus, in the space of all functions piecewise continuous 
on fa, b] (according to (10.2)) the norm is defined by the equation 


Wil=V | Fees, (10.8) 


a 


and the triangle inequality (10.6) has the form 


B - =———— 
Vi [f (x) +8 (2)Pdr< y \ f? (x) da+ Vi g°(x) dx. (40.9) 
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We now introduce the concept of orthogonal elements of a given 
Euclidean space. 

Definition 3. Two elements of a Euclidean space, f and g, are said 
to be orthogonal if a scalar product (f, g) of these elements is equal 
to zero. 

Consider in an arbitrary infinite-dimensional Euclidean space R 
some sequence of elements 


Wis Wor es ss Way eee (40.10) 


Definition 4. A sequence (10.10) is said to be an orthonormal system 
if the elemenis of that sequence are mutually orthogonal and have a norm 
equal to unity. 

A classical example of an orthonormal system in the space of all 
functions piecewise continuous on —a <2x<a2 is the so-called 
trigonometric system 


4 COs x sinz COS NZ Sin nz 
I —E OT oS OS ee (10.41 
V/ 2x , Van ’ Va! Vax , Vx 


It is easy for the reader to verify that all the functions (40.14) 
are mutually orthogonal (in the sense of a scalar product (10.2) taken 
for a = —x, b = x) and that the norm of each of these functions 
(defined by equation (10.7) for a = —a, b = x) is equal to unity. 

In pure and applied mathematics there occur many and various 
orthonormal (on corresponding sets) systems of functions. 


Here are some examples of such systems. 
{°. Polynomials defined by the equation 
1 ad? [(z?— 4)7]} 


Py ) = Sap aa (n==0, 1, 2, ...) 


are generally called Legendre polynomials. 
It is easy to see that the functions 


yo aY AEE. pai) (n=, 4,2, 20) 


formed with the aid of these polynomials constitute an orthonormal (on —! < 
<i 1) system of functions. 

2°. Polynomials defined by the equations Ty (z) == 1. 7, (z) = 2!-" cos n > 
% (arceos z), With n = 1, 2,..., are called Chebyshev polynomtals. Of all n- 
degree polynomials with coefficients of z™ equal to unity the Chebyshev polv- 
nomial 7, (z) has the smallest absolute maximum on —! <7 < 1. Vt is possible 
to prove that the functions 


| 
Ti ow oem 
4 2. 2. Ty (z) 
See. eee n=1i, 2, eee 
fo (z) Vni/4 = ® Pn (x) Vn-{/f—z? ( ) 


obtained with the aid of Chebyshev polynomials form a system orthonormal on 
—{<ir<l. 
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3°, Of frequent use in probability theory is the so-called Rademacher* system 
Yn (z)=@ (2-2) (n=0, 1, 2, ve -)s 
where  (t) = sgn (Sin 272), 
It can be proved that this system is orthonormal on a closed intervalO < x < 1. 
4°, Of use in a number of investigations in function theory is the so-called 
Haar** system which is orthonorma) on 0 <z < 41. Elements of this system 
qh, (x) are defined for all n = 0,1,...and for all & taking on values of 1, 2,4,..- 
».., 2% They are of the form 


—_ 2k—~2 2k—-1 
V an when “oni <r2< Qnti 9 

a(R) _— 2k—1 2k 

tn = (2) — V2" when ont <2 gnt1 9 


0 at the other points of {0, 4]. 


Every Haar function is a step of the same form as the function Y 2" sgn x 
on a closed interval [—2-("*}), 2-("+1)], For every fixed n increasing k moves that 
step to the right. Everywhere outside the corresponding step every Haar function 
is jdentically zero. 

Let an arbitrary orthonormal system of elements {,,} be given 
in an arbitrary infinite-dimensional Euclidean space R. Consider 


any element f of A. 
Definition 5. The Fourier series of an element f with respect to an 


orthonormal system {ip,} is a series of the form 


aulfaes (10.12) 


where f, are constant numbers called Fourier coefficients of the element 
f and defined by the equations 
fPh=(f We), &=1,2,... 


It is natural to call a finite sum 
n 
S.= 3 tas (10.18) 
the nth partial sum of a Fourier series (10.42). 


Consider along with the nth partial sum (10.13) an arbitrary linear 
combination of the first z elements of an orthonormal system {%p,} 


n 
2: Cathe (10.14) 
with any constant numbers Cy, Cy, ..., Ch- 


We shall show what distinguishes the nth partial sum of a Fourier 
series (10.13) from all the other sums (10.14). 


* Hf, Rademacher (b. 1892) is a German mathematician. 
** Alfred Haar (1885-1933) is a Hungarian mathematician. 
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Let us agree to call |] f — g || @ deviation of g from f (in the norm 
of a given Euclidean space). 

The following main theorem holds. 

Theorem 10.3. Of all the sums of the form (10.14) it is the nth partial 
sum (10.13) of the Fourier series of an element f that has the smallest 
deviation from f in the norm of a given Euclidean space. 

Proof. Taking into account the orthonormality of {\p,} and using 
the axioms of a scalar product we may write 

n nN 7 

| a Cur —f ||? = (> Cr, —f, 2, Cry, —f) = 

= = = 


= S CRCPas Px) 2 ey Crlfs DAG, I= 


n on n nt 

a 2 ~~ n .Y 9 OD 

= SN Ch 2S Cife FEI = 2 (Cate — Sf + IEP 
= {<= ia 


i 


Thus 
I S Cute ZIP = S (C.-T SS fi (10.45) 


The left-hand side of (10.15) involves a squared deviation of the 
sum (10.14) from f (in the norm of a given Euclidean space). It 
follows from the form of the right-hand side of (10.15) that that 
squared deviation is the smallest for C;, = 7, (for the first sum on 
the right of (10.15) vanishes and the other terms on the right of 
(10.15) are independent of C,). Thus the theorem is proved. 

Corollary 1. For an arbitrary element f of a given Euclidean space 
and any orthonormal system {p,} under an arbitrary choice of constants 
C; for any n 


Tv nN 
WAP S fe<il SS Catta— sll (10.16) 
Inequality (10.16) is an immediate consequence of the identity 


(10.45 


Corollary 2. For an arbitrary element f of a given Euclidean space, 
any orthonormal system {,$ and any n 


| = frtn—f P= ie — ey ff. (10.17) 


This is often called the Bessel* identity. 
To prove (10.17) it suffices to set. Cy = fy, in (410.45). 


* Friedrich Wilhelm Besse] (1784-1846) ig a German astronomer and mathe- 
matician. 
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Theorem 10.4. For any element f of a given Euclidean space and 
any orthonormal system \\p,} we have the following inequality: 


2 SNAP (10.18) 


called the Besse] inequality. 
Proof. From the nonnegativeness of the left-hand side of (10.17) 
it follows that for any n 


py} fR<II f I? (10.19) 


But this means that the series of nonnegative terms on the left of 
(10.48) possesses a bounded sequence of partial sums and is therefore 
convergent. Proceeding in inequality (10.19) to the limit as n -» 00 
(see Theorem 3.43 in [1]) we obtain inequality (10.18). Thus the 
theorem is proved. 

As an illustration consider the space of all functions piecewise 
continuous on a closed interval —xn <2< x and, in that space, 
a Fourier series with respect to the trigonometric system (10.11) 
(it is customary to call that series Fourier trigonometric series). 
For any function f (xz) piecewise continuous on —xn <2< 70 that 
Fourier series has the form 


7 1 = coskz , F sinkz 
Ts + >} (Fr, th Fe) (10.20) 


where Fourier coefficients th and Tr are defined by the formulas 


at 


fo= ae | f(a) ae, 
fi, = 7s { f(z) cos ke de, h= ve | # (@) sin ke dz 


(k=4, 2, ...). 


The Bessel inequality true for any function f (z) piecewise conti- 
nuous on —t <2z< 7 has the form 


+> G+ < \ f? (x) dz. (10.21) 
h=1 ~ I 


20—-01684 
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In this case the deviation of f (z) from g (z) in the norm is equal 
to the so-called root-mean-square (or standard) deviation 


Wf-el=\Vo | ¢@—ei@eas. (10.22) 


= st 


In the theory of Fourier trigonometric scries, however, a somewhat 
different notation for both the Fourier series (10. 20) and the Bessel 
inequality (40. 21) is used. It is the Fourier trigonometric scrics 
(10.20) that is usually written as 


3 aa ») (a, cos Kear -L b) sin kz), (10.20') 
k=1 
where 


a ot 
— fo A 
ayo = Jig on 5 f(x) dx, 


_ at 

fr 4 
a, = — — x cos kz dz, 
Van Ft =a ita) (10.23) 
b, = Ph = | f(z) sin ka dx 


i 


(A= 1, 2,...). 
With this notation the Bessel inequality (10.21) becomes 


oo «T 
2 2 4 n ‘ 
4S (ai I< | Pieyde. (10.21) 
ke 4a 


=| - 7 


Remark. It follows from the Besse] inequality (10.21’) that for 
any function f (x) piecewise continuous on —a rin the quan- 
tities a, and 0b, (called Fourier trigonometric coefficients of f (z)) 
tend to zero as |; — oo (by the necessary condition for convergence 
of the series on the Jeft of (410.21’)). 


10.2, CLOSED AND COMPLETE ORTHONORMAL SYSTEMS 


As in the preceding section we sha} consider an arbitrary orthonor 
maj system {y,} in any infinite-dimensional Euclidean space &, 
Definition 1. An orthonormal system {y),) is said to Le closed if for 
any element f of a given Euclidean space R and for any positive number rt 
there is a linear combination (10.14) of a finite numler of elements of 
{u,} such that its deviation from f (in the norm of Jt) is less than e. 
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In other words, a system {1p,} is said to be closed if any element f 
of a given Euclidean space R can be approximated in the norm of 
that space to an unlimited precision by linear combinations of a finite 
number of elements of {p,}. 

Remark 1. We drop the question of whether there are closed ortho- 
normal systems in any Euclidean space. Note that in Chapter 411 
westudy an important subclass of Euclidean spaces, the so-called 
Hilbert spaces, and establish that there are closed orthonormal 
systems in every such space. 

Theorem 10.5. If an orthonormal system {1p,} is closed, then for 
any element f of a given Euclidean space the Bessel inequality (10.18) 
goes over into an exact equation 


3 RANI (10.24) 


called Parseval’s* formula. 

Proof. Fix an arbitrary element f of the Euclidean space under 
consideration and an arbitrary positive number e. Since the system 
{.} is closed, there is m and numbers C,, Cy, ..., C, such that 
the square of the norm on the right of (10.46) is less than e. By 
virtue of (10.146) this means that for an arbitrary e > 0 there is 
n for which 


NiP— 2 fi<e. (10.25) 


For integers greater than that n inequality (10.25) is clearly true, 
for the sum on the left of (10.25) may only increase as n increases. 
So we have proved that for an arbitrary ¢ > 0 there is m beginning 
with which inequality (40.25) is true. 
In conjunction with inequality (10.49) this means that the series 


SR converges to the sum || f ||*. Thus the theorem is proved. 
k= 


Theorem 10.6. If an orthonormal system {p,} is closed, then what- 
ever the element f the Fourier series of that element converges to it in 
the norm of the space under consideration, i.e. 


lim || 3 faba —f |= 0. (10,26) 


Proof. The statement of this theorem follows immediately from 
equation (40.17) and from the preceding theorem. 

Remark 2. In the space of all functions piecewise: continuous 
on a closed interval —n < xz <n convergence in the norm (10.26) 


* M. Parseval (d. 1836) is a French mathematician. 
20* 
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turns into convergence in the mean on that interval (see Section 4.2.3). 
Thus, if the closure of the trigonometric system (10.41) is proved, 
Theorem 10.6 will state that for any function f (x) piecewise contin- 
uous on —n Mae<cn the Fourier trigonometric series of that func- 
tion converges to it on —2 SK 2<ia in the mean. 

Definition 2. An orthonormal system {yp,} is said to be complete 
if besides the zero element there is no other element f of a given Euclidean 
space that would be orthogonal to all the elements 1p), of {1px}. 

In other words, a system {ip,} is said to be complete if any ele- 
ment / orthogonal to all the elements yp), of {p,} is a zero element. 

Theorem 10.7. Any closed orthonormal system {xp,} is complete. 

Proof. Let a system {p,} be closed and let f be any element of 
a given Euclidean space orthogonal to all the olements 1p, of {p,)}. 

Then all Fourier coofficients f, of f with respect to {ip} are zero 
and therefore, by Parseval’s formula (10.24), [jf {{ =O too. The 
Jast equation (by Axiom 4° for the norm) implies that f = 0. Thus 
the theorem is proved. 

Remark 3. We have proved that in an arbitrary Euclidean space 
the closure of an orthonormal system implies its completeness. In 
Chapter 11 an example is given showing that in an arbitrary Eucli- 
dean space the completeness of an orthonormal system docs not 
in general imply its closure. It is also proved there that for a very 
important class of Euclidean spaces, the so-called Hilbert spaces, 
the completeness of an orthonormal system is equivalent to its 
closure. 

Theorem 10.8. For any complete (and clearly for any closed) or- 
thonormal system {1p;,} two distinct elements f and g of a given Euclidean 
space cannot have the same Fourier series. 

Proof. If all Fourier coefficients of f and g coincided, then all 
Fourier coefficients of the difference f — g would be zero, i.e. the 
difference f — g would be orthogonal to all the elements 1p, of a com- 
plete system {ip,}. But this would mean that { — g is a zero clement, 
i.e. that the elements f and g coincide. Thus the theorem is proved. 

This completes our discussion of the general Fourior series with 
respect to an arbitrary orthonormal system in any Euclidean space. 

Out next goal is to make a detailed study of the Fourier serics 
with respect to the trigonometric system (10.41). 


10.3. THE COMPLETENESS OF THE TRIGONOMETRIC SYSTEM. 
COROLLARIES 


10.3.1. Uniform approximation to a continuous function by trigono- 
metric polynomials. We shall establish here the closure (and there- 
fore comploteness) of the trigonometric system (10.11) in the space 
of all functions piecowise continuous on a closed interval —xa Qz< 
< x. But before proving the closure of the trigonometric system 
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we shall establish an important theorem on uniform approxima- 
tion to a continuous function by the so-called trigonometric po- 
lynomials. 

A trigonometric polynomial is an arbitrary linear combination 
of any finite number of elements of the trigonometric system (10.11), 
i.e. an expression of the form 


T (x) =Co+ > (C, coskx+C;, sin kz), 
h=1 


where n is any integer and C;, and C;, (k =1, 2, ..., m) are ar- 
bitrary constant reals. 

Note two quite elementary statements: 

1°. If P (x) is any algebraic polynomial of arbitrary degree n, then 
P (cos z) and P (sin x) are trigonometric polynomials. 

2°. If T (x) is a trigonometric polynomial, then either of the expres- 
sions T (x)-sin x and T (xz)-sin® x is a trigonometric polynomial. 

Both statements follow from the fact that the product of two 
(and therefore of any finite number of) trigonometric functions* 
of x reduces to a linear combination of a finite number of trigono- 
metric functions of independent variables of the type kx (see this 
for yourself). 

In the theory of Fourier trigonometric series an important role 
is played by the concept of periodic function. 

A function f (x) is said to be a periodic function with period T if: 
(1) f (x) ts defined for all real x; (2) for any real x 


fie+T7) =f (2). 


This equation is usually called periodicity condition. Considera- 
tion of periodic functions results from the study of various oscil- 
latory processes. 

Note that all the elements of the trigonometric system (40.44) 
are periodic functions with period 2x. 

The following main theorem is true. 

Theorem 10.9 (Weierstrass). If a function f (x) is continuous on 
a closed interval |—x, x] and satisfies the condition f (—x) = f (x), 
then it can be uniformly approximated on {[—n, x] by trigonometric 
polynomials, i.e. for this function f (x) and for any positive number e 
there is a trigonometric polynomial 7 (z) such that for all z in 
[—zx, m] at once 


lf (z) — T(z) |<e. (10.27) 
Proof. For convenience we carry out the proof in two steps. 


4°, First suppose additionally that f (z) is even, i.e. satisfies the 
condition f (—z) = f (x) for any z in [—n, a]. 


* In this case we mean cosine and sine by trigonometric functions. 


310 Fundamentals of Mathematical Analysts 


By the theorem on the continuity of the composite function y = 
= f(x), where z = arc cost (see Section 4.7 of [1]), the function 
I(t) = f (arc cos #) is a continuous function of the independent va- 
riable ¢ on a closed interval —1 <t<c1. Thereforo by the Weier- 
strass theorem for algebraic polynomials (see Theorem 1.18) given 
any ¢ > 0 we can find an algebraic polynomial P (t) such that 
| f (arc cos t) — P(t) | <e at once for all fin —1 <t< 1. 

On letting tf = cos x we get 


| f(z) — P (cos zr) j<ce (10.28) 


at once forallxinOxr<iua. 

Since both functions f (x) and P (cos z) aro even, inequality (10.28) 
is true for all cx in —n Gx<OQ, too. Thus (410.28) is true for all z 
in —%t <2 <2 and since (by Statement 1° above) P (cos zx) is a 
trigonometric polynomial, this proves the theorem for an even func- 
tion f (z). 

Now note that a function f (x) satisfying the hypotheses of the 
theorem being proved can be, periodically with period 2n, extended 
to the whole infinite straight line —coo <2r< oo, so that the extend- 
ed function will bo continuous at cach point zx of the line. If f (2) 
is extended in this way, then since P (cos z) is also a periodic function 
of period 2x, we find that for aneven function f (x) inequality (10.28) 
is true everywhere on an infinite straight line —00 <x < oo. 

2°. Now let f (z) be an entiroly arbitrary function satisfying tho 
hypotheses of the theorem. We oxtend it periodically with period 
2x to the whole infinite straight line and form with the aid of it the 
following two even functions: 


fy (z)= Let) (40.29) 
fo (x) = LEN) sin Le (10.30) 


By what was proved in 1°, for any e > 0 thero are trigonometric 
polynomials 7, (x) and 7, (x) such thal everywhere on the infinite 
straight line 


lA) — Ty @i<e4, | fe (t) — 2, (x) | <el4, 
and therefore 
lf; (x2) sin? s—7, (x) sin*z| < &/4, 
\f. (x) sin z—~—T, (x) sin z| <ce/4. 
Adding together the last two inequalities, considering that the modu- 
lus or absolute value of the sum of two quantities is not greater than 


the sum of their moduli and taking into consideration equations 
(10.29) and (10.30) we see that everywhere on the infinite straight 
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line 
\f (x) sin?2—T3(x)| << c/2, (40.34) 


where 7, (x) is a trigonometric polynomial equal to 73 (z) = 11 (t)X 
Xsin? x -+ TL, (x) sin z. 

Instead of f (x) in the above discussion we can take a function 
f(x + 1/2)*. In full analogy with (10.31) we shall have that for 
f(z + n/2) there is a trigonometric polynomial 7’, (z) such that 
everywhere on an infinite straight line 


If (x-+-n/2) sin?x—T, (x)| <e/2. (10.32) 


Replacing in (10.32) x by « — x/2 and denoting by 7’, (x) a trigo- 
nometric polynomial of the form 7’, (sr) = T, (c — a/2) we see that 
everywhere on an infinite straight line 


\f (x) cos? r—T, (z)| < /2. (10.33) 


Finally, additing together inequalities (10.31) and (40.33) and 
denoting by 7 (x) a trigonometric polynomial of the form 7 (x) = 
= T, (x) + 7; (z) we see that everywhere on the inhnite straight 
line inequality (40.27) is true. Thus the theorem is proved. 

Remark. Each of the conditions, (1) the continuity of 7 (zx) on 
{—zx, x], (2) the equality of f (—m) and f (x) in value is a necessary 
condition for f (x) to be uniformly approximated on —n SN t@QU 
by trigonometric polynomials. 

In other words, the Weirstrass theorem can be restated as follows: 
for a function f (z) to be approximated uniformly on a closed interval 
[—n, x] by trigonometric polynomials, it is necessary and sufficient 
that f (x) should be continuous on [—x, x] and satisfy the condition 
f(—n) = f (x). 

Sufficiency makes the content of Theorem 10.9. 

We shall prove the necessity. Let there be a sequence of trigono- 
metric polynomials {7,, (x)} converging uniformly on [—zx, x} to 
f (z). Since every function 7, (x) is continuous on [—nx, x], so is 
f(x), by Theorem 1.8. Given any ¢ > 0 we can find a polynomial 
T,, (c) such that | f(z) — 7, (z) |<e/2 for every z in [—x, a]. 
Therefore . 


[f(—a) — Tr (—m) | <ce/2, | f (a) — Ty (a) | <e/2. 


From the last two inequalities and from the equation T, (—x) = 
= I, (x) following from the periodicity (with period 2x) condition 
we conclude that | f (—x) — f (x) |<< ¢ whence f (—x) = f (x) (by 
the arbitrariness of ¢ > 0). 

10.3.2. The proof of the closure of the trigonometric system. Relying 
on the Weierstrass theorem we prove the following main theorem. 


* For this function satisfies the same conditions as the extended jfunction 
f (z) does. 
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Theorem 10.10. The trigonometric system (10.11) is closed*, i.e. 
for any function f (x)= piecewise continuous on a closed interval 
[—a, =] and any positive number e there is a trigonometric poly- 
nomial 7 (x) such that 


Wf@—Tai=V lr@—-Te@tdr<e. (10.34) 


Proof. First of all note that for any function f (z) piecewise con- 
tinuous on [—zx, a] and for any « > 0 there is a function F (zx) con- 
tinuous on [—ax, a], satisfying the condition F (—a) = F (x) and 
such that 


Ni (a—F (zx) f= V/ \ [f{ (z) — F (x)]? de < &/2. (10.35) 


Indeed, it suffices to take the function F (x) to coincide with 
f (x) everywhere except sufficiently small neighbourhoods of the 
discontinuity points of f (z) and ofJthe point z = zx and to take F (z) 
to be a linear function in those neighbourhoods so that F (zx) is con- 
tinuous throughout (—a, a] and satisfies F (—n) = F (x). 

Since a piecewise continuous function and the linear function 
completing it are bounded, choosing those neighbourhoods of the 
discontinuity points of f (x) and of the point z = a to be sufficiently 
small ensures that inequality (10.35) holds. 

By the Weierstrass theorem 10.9, given a function F (xr) we can 
find a trigonometric polynomial T (x) such that for every z in [—a, a] 


[FF (ce) — T (2) |< e/2V On. (40.36) 
From (10.36) we conclude that 


——— 
WF (2)—T (a) = \/ \ [F (2) —T (2)J2dzr<e/2. (10.37) 


— st 


From (10.35) and (10.37) and from the triangle inequality for the 
norms 


Wi@)—-TAL<NI@—-—FeUI+rlFl@—TEe) I 


we obtain inequality (10.34). Thus the theorem is proved. 
Remark 1. From Theorems 10.10 and 10.7 it follows imme- 
diately that ihe trigonometric system (10.14) is complete. From 


this in turn it follows that the system {1/2 sin nz} 


* And therefore (by Theorem 10.7) complete. 
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(t=1,2,...) is complete on the set of all functions piecewise 
continuous on a closed interval [0, x] (or respectively on [—x, QJ). 
Indeed, any function f(x) piecewise continuous on [fQ, 2} and 
orthogonal on [0, zx] to all the elements of the system 


{/ sin nic} turns out to be orthogonal on {—nx, 7] to all 


the elements of the trigonometric system (10.11) after an odd 
extension to [—7, 0]. By the completeness of the system (10.11) 
that function is zero on [—x, nm] and therefore on [0, mn}. Quite 


similarly it can be proved that the system Te VY = cos nz 


(n=1,2,...) is complete on the set of all functions piecewise 
continuous on [0, x) (or respectively on [—17, Q]). 


Remark 2. It can be shown that of the orthonormal systems indicated in 
Section 10.4 those formed using Legendre polynomials, Chebyshev polynomials 
and Haare functions are closed systems and the Rademacher system is not. 


10.3.3. Corollaries. 
Corollary 1. For any function f (x) piecewise continuous on a closed 
interval |—x, nm] Parseval’s formula 


45 (a? + BR) = = \ f? (x) dz (10.38) 
h=={ ~T 


holds (this follows from Theorem 410.5). 

Corollary 2. The Fourier trigonometric series of any function f (x) 
precewise continuous on [—x, m] converges to that function on |—n, x 
in the mean (this follows from Theorem 10.6 and Remark 2 to it). 

Corollary 3. The Fourier trigonometric series of any function f (zx) 
piecewise continuous on [—n, x] can be integrated term by term on 
{—z, x] (this follows from Corollary 2 above and from Theorem 14.14). 

Corollary 4. If two functions f (x) and g (x) piecewise continuous on 
—m, m=] have the same Fourier trigonometric series, then they coincide 
everywhere on [—x, x} (this follows from Theorem 10.8). 

Corollary 5. If the Fourier trigonometric series of a function f (xz) 
piecewise continuous on [—nx, x] converges uniformly on some closed 
interval [a, b] contained in [—n, x] then it is to the function f (x) 
that wt converges on la, bl). 

Proof, Let F (x) be the function to which the Fourier trigonometric 
series of f (z) uniformly converges on la, 4]. We prove that F ‘2) == 
== f(z) throughout [a, b]. Since uniform convergence on [a, 3) 
implies convergence in the mean on it (see Section 1.2.3), the Fourier 
trigonometric series of f (x) converges to F (z) on [a, b] in the mean. 
This means that for an arbitrary e > O,Ahere is n, beginning wi ' 
which the nth partial sum of the Fourie £ , Mometric series S 


CY 
¢ “MRS 
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satishes the inequality 


fe 
WF(@—S, a= |/ | F(@)—S, (a) Per <el2. (10.39) 


On the other hand, by Corollary 2 the sequence S, (z) converges 
to f (z) in the mean throughout [—a, 2] and therefore on [a, }] too, 
i.e. for the fixed arbitrary « > 0 there is rn. beginning with which 


{Ss 


eo 


ISa(—f@ l= | \ 18. —F Pdr <erd. (10.40) 


” 


From (10.39) and (10.40) and from the triangle inequality 


WF) -fOU<IFP@®—-S,@I+1S, @) -f@ tl 


it follows that || F (z) — f (x) ||] <s. From tho last inequality and 
from the arbitrariness of e > 0 it follows that j| F (x) — f (x) || =0 
and from this on the basis of Axiom 1° for the norm we conclude that 
F (x) — f (x) is the sero element of a space of functions piecewise con- 
tinuous on la, b], i.e. a function identically zero on fa, db]. Thus 
Corollary 5 is proved. 

Remark {. Of course, in Corollary 5 the closed interval {a, b] 
may coincide with the entire interval [—a, <a], i.e. from the uniform 
convergence of the Fourier series of f (x) throughout [—a, a] it follows 
that tt is to f (x) that that series converges on that interval. 


Remark 2. Quite similar corotlanes will be true for a Fourier series with 
respect to any other closed orthonormal system in the space of functions piece- 
Wise continuous on an arbitrary interval [«, b] with scalar product (40.2) and 
norm {{0.8). The orthonormal systems associated with Legendre and Chebyshev 
polynomials and the Haare system (see Section 10.1) may serve as examples of 
such systems. 


10.4. THE SIMPLEST CONDITIONS FOR THE UNIFORM CONVERGENCE 
AND TERM-BY-TERM DIFFERENTIATION OF A FOURIER 
TRIGONOMETRIC SERIES 


10.4.1. Introductory remarks. Of great importance in mathemati- 
cal physics and ina number of other branches of mathematics is the 
question of under what conditions the Fourier trigonometric series 
of a function f (x) converges (to that function) at a given point z 
of a closed interval [—a, =). 

It was known as far back as the late NIN century that there are 
functions, continuous on [—a, a] and satisfying the condition 
T(—x) = f (a), whose Pourier trigonometric series diverge at a prea-- 
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signed point of [—n, 2] (or diverge even on an infinite set of points 
of {—x, a] everywhere dense on that interval)*. 

Thus the continuity alone of f (x) on [—x, x] without any addi- 
tional conditions fails to ensure not only the uniform convergence 
of the Fourier trigonometric series of that function but even the 
convergence of the series at a preassigned point of [—n, mz]. 

In this and the next section we shall explore what requirements 
should be added to the continuity of f (x) (or introduced instead of 
the continuity of f (x)) to ensure the convergence of the Fourier trig- 
onometric series of f (x) at a given point as well as its uniform con- 
vergence throughout {—x, x] or some part of it. 

Yet another question arises in the study of the convergence of 
a Fourier trigonometric series: must the Fourier trigonometric series 
of any function 'f (x) piecewise continuous (or even strictly conti- 
nuous) on [—z, x] converge at least at one point of the interval? 

A positive answer to that question was obtained only in 1966. 

It is a consequence of the fundamental theorem, proved in 1966 
by L. Carleson**, that has solved N. N. Luzin’s*** famous problem 
formulated as early as 19414: The Fourier trigonometric series of any 

oa 


function f (x) for which there is an integral \ f? (x) dx understood in the 


—Jt 

weesgue sense converges to that function almost everywhere on |—n, 
Tt FA RR 

From Carleson’s theorem it follows that the Fourier series of any 
function f (x) integrable on [—x, x] in the proper Riemann sense, 
as well as that of any piecewise continuous function f (x), converges 
almost everywhere on [—zx, 2] to f(z) (since there is an integral 
at 


| f* (x) dx in the Riemann sense and therefore in the Lebesgue sense 
for such a function). 

Note that if a function f (x) is integrable on a closed interval 
[—x, x] not in the Riemann sense but only in the Lebesgue sense, 
then the Fourier trigonometric series of that function may fail to 


* The earliest example of such a function was constructed by the French 
mathematician P. du Bois-Reymond in 1876. 
+** Lennart Carleson is a modern Swedish mathematician. A full proof of 
Carleson’s theorem can be found in Selected problems on exceptional sets by Lennart 
Carleson (Russian translation in Matematika, Vol. II, No. 4, 1967, pp. 113-432). 
*** Nikolay Nikolayevich Luzin (1883-1950) is a Soviet mathematician, the 
founder of the modern Moscow school of function theory. The statement of 
Luzin’s problem solved by Carleson and of his other problems can be found in 
N.N. Luzin. Zhe Integral and Trigonomeiric Series, Moscow, Leningrad, Gos- 
tekhizdat (1951). 
+*** For the definition of the integral in the Lebesgue sense and of the con- 
vergence almost everywhere on a given closed interval see Chapter 8. 
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converge at any point of [—x, al. The Sovict mathematician 
A. N. Kolmogorov* was the first to construct in 1923 an example of 
a function f (z) integrable on [—ax, a] in the Lebesgue sense, with 
an everywhere divergent Fourier trigonometric scries. 

{0.4.2. The simplest conditions for absolute and uniform convergence 
of a Fourier trigonometric series. Let us agree to use tho following 
terminology: 

Definition 1. We shall say that a function f (x) has a piecewise con- 
tinuous derivative on a closed interval {a, b] if the derivative f' (z) 
exists and is continuous everywhere on [a, b] except possibly for a finite 
number of points at each of which the function f' (x) has finite right- 
and left-hand limiting values**. 

Definition 2. We shall say that a function f (x) has a piecewise con- 
tinuous derivative of order n > 1 ona closed interval [a, b) if the func- 
tion {(-")) (x) has on that interval a piecewise continuous derivative 
in the sense of Definition 1. 

The following main theorem is true. 

Theorem 10.11. If a function f(z) is continuous on a closed interval 
[—x, x], has on it a piecewise continuous derivative and satisfies the 
condition f(—m) = f(x), then the Fourier trigonometric series of 
f (z) converges to that function uniformly on [—x, 1]. Moreover, the 
series made up of the absolute values of the terms of the Fourier, irigono- 
metric series of f (x) converges uniformly on [—nx, x). 

Proof. 11 suffices to prove that the series made up of the absolute 
values of the terms of the Fourier trigonometric series? of f (z) 


fol >} {|a, cos kx| + [b, sin kz]} (10.41) 


h=l 


converges uniformly on [—a, x], for this will imply both the uni- 
form convergence on {[—zx, x] of the Fourier trigonometric series it- 
self of f (x) and that it is to the function / (x) that that series con- 
verges (by Corollary 5 of Section 10.3.3). 

By virtue of the Weierstrass test (see Theorem 41.4), to prove uni- 
form convergence on [—a, st] of the series (10.44) it suffices to estah- 
lish the convergence of the number series 


oe lant + [al (10.42) 


* A construction of 4.N. Kolmogoroy's example can be found on pages 412 

to 421 of the book: N.N. Bari, Trigonometric Series, Moscow, Fizmatgiz (1964). 

** The function /’ (z) may turn out to be undefined at a finite number o 

points of [a, b). We specify it arbitrarily at those points (for instance, we set It 
equal to the half-sum of the right- and left-hand limiting values). 
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majorizing it. Denote by a, and #; the Fourier trigonometric coef- 
ficients of the function f’ (x) specifying the function arbitrarily in 
a finite number of points at which there is not derivative of 7 (z)*. 
Integrating by parts and considering that f (x) is continuous through- 
out [—a, a] and satisfies the relation f (—x) = f (x) we obtain 
the following relations connecting the Fourier trigonometric coef- 
ficients of the function /’ (x) and those of the function / (z) itself:** 


ws f 
| f(z)sinkzdz=hk-b,, 


ms § 
y= ) f° (2) 008 tre d= ke 
-7 ~s 


wv ot 
B. = = \ f’ (x) sinkzdz= kek \ f (z) coskxdz= —k.ay. 
=~ st ~3t 
Thus 
Jay] + [bn] = TSE 5. HO 


I kk! 


and, to prove the convergence of the series (10.42), it suffices to estab- 
lish the convergence of the series 


a {+ At}. (10.43) 


The convergence of the series (10.43) follows from the elementary 
inequalities*** 


wal <1 (ag + 4), 


~~ 


and from the convergence of the series 
oo co 4 
>) (hk +-BR), >) a> (10.45) 
hol h=! 


* For instance, we may set the function 7’ (x) at those points equal to a half- 
sum of the right~ and left-hand limiting values. 

** When integrating by parts one should divide (—z, a] into a finite number 
of subintervals having no interior points in common on each of which the 
derivative f’ (z) is continuous and, taking an integration by parts formula for 
each of the subintervals, consider that when summing the integrals over all 
subintervals all the substitutions vanish (because of the continuity of f (z) 
‘throughout [—x, 2] and the conditions f (~x) = f (z)). 

1 


*** We proceed from the clementary inequality | a@{-{b| Sy (a? + 6°) 
following from the nonnegativity of (| a|—] bd })*. 
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the first of them converging by virtue of Parseval’s formula for a 
piecewise continuous function 7’ (x) and the second by virtue of the 
Cauchy-Maclaurin integral test (see Section 13.2 of [4]). Thus the 
theorem is proved. 

Remark. If a function f (xz) satisfying the hypotheses of Theo- 
rem 10.11 is periodically (with period 27) extended to the whole 
infinite straight line, then Theorem 10.41 will state convergence of 
a1 Fourier trigonometric series to the function thus extended that is 
uniform throughout the infinite straight line. 

10.4.3. The simplest conditions for term-by-term differentiation 
of a Fourier (rigonometric serics. We shall first prove the following 
lemma on the order of Fourier trigonometric coefficients. 

Lemma 1, Let a function f (x) and all of its derivatives up to some 
order m (m boing a nonnegative integer) be continuous on a closed 
interval |—n, x] and satisfy the conditions 


f(—) =} (a), 
f' (—a)= f(x), 40.46) 


fos) = 


Also let f (x) have on [—n, xn] a piecewise continuous derivative of 
order (m -+- 1). Then the following series converges: 


2 Bland + Lal), (10.47) 


where a, and b,, are the Fourier trigonometric coefficients of f (x). 

Proof. Denote by @, and f;, the Fourier trigonometric coefficients 
of the function /("*) (x), specifying the function arbitrarily in a 
finite number of points at which tho function f (zr) has no derivative 
of order (m -i- 1). Integrating the expressions for a, and fi, (m - 4) 
times by parts and considering the continuity throughout [—a, a] 
of the function f (x) itself and of all its derivatives up to order m 
and taking into account relations (10.46) we establish the following 
relation of the Fourier trigonometric coefficionts of the function 
fe) (x) to the function f (x) itself*: 


lent -F IB | = At! {fay | - 1, fp. 


Thus 
fay pt Joy) ~~ HSL AO 


* When integrating by parts one should divide the closed interval [—=, 2] 
into a finite number of subintervals having no interior points in common on each 
of which /0"*4) (z) ig continuous and consider that summing the integrals over 
all the subintervals makes all substitutions vanish. 
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and the convergence of the series (10.47) follows from the elementary 
inequalities (10.44) and from the convergence of the series (10.45), 
the first of them converging by virtue of Parseval’s formula for a 
piecewise continuous function f("*) (xz) and the second by virtue 
of the Cauchy-Maclaurin test. Thus the lemma is proved. 

An immediate consequence of Lemma 1 is the following theorem. 

Theorem 10.12. Let a function f (x) satisfy the same conditions as 
in Lemma 4, with m1. Then the Fourier trigonometric series of 
f (x) can be differentiated term by term m times on |—n, x). 

Proof. Let s be any of the numbers 1, 2, ..., m. Asa result of an 
s-fold term-by-term differentiation of the Fourier trigonometric 
series of f (x) we obtain the series 


5 ks { a, cos (kx > | +-b, sin ( fe —=)} . (10.48) 
h=1 


Note that for every x in [—a, x] both the original Fourier trig- 
onometric serics and the series (10.48) (with any s = 4, 2, , m) 
can be majorized by the convergent number series (10. 47). “By the 
Weierstrass test (see Theorem 1. 4) both the original Fourier trigono- 
metric series and each of the series (10.48) (with s = 4, 2, ..., m) 
converge uniformly on [—zx, a] and this (by virtue of Theorem 4, 9) 
ensures the possibility of an m-fold term-by-term differentiation of 
the original Fourier series. Thus the theorem is proved. 


10.5. MORE PRECISE CONDITIONS FOR UNIFORM 
CONVERGENCE AND CONDITIONS FOR CONVERGENCE 
AT A GIVEN POINT 


10.5.4. The modulus of continuity of a function. Hélder classes. 
Weshall begin with explaining the concepts characterizing the smooth- 
ness of the functions under study and with defining the classes of 
functions in terms of which we shall formulate the conditions for 
convergence of a Fourier trigonometric series. 

Let a ppnction f (x) be defined and continuous on a closed inter- 
val [a, b 

Definition 1. For every § > 0 the modulus of conlinuily of f (z) 
on la, b] is the supremum of the absolute value of the difference | f (x') — 
— f(x") | on the set of all x’ and x” of [a, bj satisfying the condition 
| x’ __ a" < 5. 

We shall denote the modulus of continuity of f (x) on [a, b] by 

o (5, f). So by definition* 

0 (5, f= sup. Ifle)—f@"- 


|x’ ~x 


x’. x“E(a, b] 


* Recall that the symbol € stands for “belongs to, is in”, so that the notation 
zx’, z” € {a, bj designates that the points z’ and z” are in a closed interval (a, 6]. 
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Jt is immediate from the Cantor theorem (see Theorem 10.2 in 
(4]) that the modulus of continuity w (5, f) of any function f (x) con- 
tinuous on la, b) tends to cera as & —O0*. 

However for an arbitrary function / (z) only continuous on fa, 0] 
nothing in general can be said about the order of its modulus of con- 
tinuity o (0, f) with respect to a small 6. 

We now show that if a function f (x) is differentiable on {a, 6b] and 
if its derivative f’ (x) is bounded on la, b), then the modulus of conti- 
nuity of f (x) on that interval, w (6, f), hasan order w (5, f) = O (6)**. 

Indeed, it follows from the Lagrange*** theorem that for any points 
x’ and x” in [a, b] there is a point E contained between 2’ and 2x" 


such that 
Lf’) —f(z") l= 17 (8) bie’ — 2" |. (10.49) 


Since the derivative /’ (x) is bounded on [a, b] thero is a constant Af 
such that for every z in the interval | f’ (z) |< A and therefore 
|f’ (&) |< Mf. From the last inequality and from (10.49) we con- 
clude that | f (x) — f (v") |< A/6 for every zx’ and 2x” in Ia, BJ 
satisfying |[2’ — 2” |< 6. But this means that o (5, f) < M6, 
ic. w (6, f) = O (8). 

Let « be any real number in the half-open interval O<a <1. 

Definition 2. We shall say that a function f (x) belongs on a closed 
interval [a, b] toa Holder class C® with index a (0 <a <1) if the 
modulus of continuity of f (x) on [a, b] has an order w (6, f) = O (5%). 

To designate that f (z) belongs to a Hélder class C*% on [a, b] one 
usually uses the notation: f (x) € C% fa, b). 

Note at once that if a function f(z) is differontiable on la, 0] 
and its derivative is bounded on that interval, then f (z) a fortiori 
belongs on la, b) to the Hélder class C1**** (this statement follows 
immediately {rom the relation w (5, f) = O (6) proved above). 

Remark. Let f(z) €C%[a, b}]. The supremum of the fraction 


ie on the set of all z’ and 2x” of {a, b] not equal 
hs ol oaedl® 7 

to cach other is called the Hélder constant (or the Holder coef- 
ficient) of f(x) (on [a, b}]). The sum of the Hélder constant of 
f(z) on [a, b] and the supremum |/f(z)| on that interval is 
called the Hélder norm of |f(x){ on f[a, 6] and designated 


7 lKcata. bj‘ 


a 


* or (by the Cantor theorem) given any e>0 we can find 6 > 0 such that 
1 f(z’) =f) |< ce for every z’ and z” in (a, b] satisfying the condition 
‘— x" ~~ Ve 
arr Recall that the symbol @ = O (6) was introduced in Chapters 3 and 4 
of {4] and denotes the existence of a constant Jf such that [ @ { <= Af6. 
wx Soo Theorem 8.12 in [41] . 
#e** The Holder class C? corresponding to the value & = 1 is often called the 


Lipschitz class. 
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Example. The function f(z) =z belongs on a closed inter- 
val ". 4} to the class C*/*, for (f(2)—f(x")|=V2z'—2’x 
Xe SI x’—zx” for any zx’ and 2” in (0, 1] related by 

z 
the condition 2’ => 2” (the Hélder constant that is the supremum 
— 27" 


[> 7" 
on [0, 1] of the fraction —————— being equal to unity and the 


Vz 
VE+V 23 
Hilder norm to two). 

10.5.2. An expression for the partial sum of a Fourier trigonometric 
series. Let f (z) be an arbitrary function piecewise continuous on 
a closed interval [—z, 2]. We shall extend that function periodically 
(with period 2x) to the whole infinite straight line*. Denote by 
S, (t, f) a partial sum of the Fourier trigonometric series of f (zx) 
at a point x equal to 


S,(z, f= 3+ >) (a, cos kx +b, sin kz). (10.50) 
k==1 


Inserting into the right-hand side of (10.50) the values of the Fourier 
coefficients** 


wt 
‘ 
a=— | f(y) dy, 
— It 
ae pig 
= | f(y) cos ky dy, — | f(y) sin ky dy (A= 1, 2,...) 
— JL w= IT 


and taking into account the linear properties of the integral we get 


Sn (x, f) == \ f(y) | =+ »> (cos ky cos kz - sin séy sin hz) | dy = 
=n h=41 


=— f(y) [$43 cos ke(y—2) | dy 
k=1 


—T 


* By the convention adopted as far back as in Section 10.1 a piecewise con- 
tinuous function f (z) must have at every point z a value equa) to a half-sum of 
the right- and left-hand limiting values. For this property to take place and for 
{ (2) to be periodically (with period 2m) extended to the whole infinite straight 

ine we must require that the relation f (x) =f (—x) = (1/2) [f (—-x + 0) > 
+ f(x — 0)] should hold for the extended function. In other words, we shall 
say that a function f (z) defined on an infinite straight line is a periodic eztensi- 
of a function f (z) piecewise continuous on a closed interval [—x, x] if both 
functions coincide on the interval —n < z < x and if the function f (z) de=nes 
on the infinite straight line satisfies the periodicity condition f (x — 2n) = > (27 
and the condition f (m) = 7 (—x) = (1/2) {f (—2 + 0) + f (a — 0)]. 
** See formulas (10.23). . 
iN . 


21—01684 rs 
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for any point z of an infinite straight line. On making in the last inte- 
gral a change of variable y = ¢ + 2 we arrive at the following ex- 
pression 
4 THX { Tt 
S, (2, => \ f(xz+t) | + »Y coskt | dt. (10.54) 
—T—Xx Rw: § 


Note now that since either of the functions f(z-+i) and 


n 
ls+> cos It | is a periodic function of the variable ¢ with 
ho! 
period 22, the entire integrand in (10.51) (denote it briefly by 
F(t)) is a periodic function of f with period 21. Also note that 
integration in (10.51) is over [—aA—z, n—2z] of length equal 
to 20, i.c. equal to the period of the integrand. We use the 
following elementary statement: if F(t) is a periodic function of 
period 2x integrable over any finite closed interval, then all 
integrals of that function over any of the closed intervals of 
length equal to the period 2x are equal to one another, i.e. for 
any x 


oboe xX - ot 
| F(tydat= ( F(yare. (10.52) 
~—T—x =i 
Equation (10.52) allows us to rewrite formula (10.54) as follows: 
ot n 
Sn(2, y= \ #40) [=+> cos it | dt. (10.53) 
~~ 1 i=] 


Compute the sum in the square brackets in (10.53). To do this note 
that for any k and any value of t 


2 sin = cos kt = sin ( k a. =) t—sin (k — 5) t. 


2 2 
* To prove this statement it suffices, using the additive property, to reser 


os Crs 


resent ( F (t) dt as the sum of three integrals 


=—% we T Tex 
{ F(t) dt + \ F(t) dt+ \ I(t) at 
—texX — st ot 


and to note that with the aid of the periodicity condition F (t) = F (t+ 2x) 
and the change of variable ¢ = y — 2x the first of the three integrals js reduced 
to the third taken with the minus sign. Indeed, 


~at = 1 T.H-<x 
F(t)dt= \ F(t 4-22) dt= | F (uj) dt=— F (uv) dy. 


—1-2 -A-x a—x 
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Summing this equation over all & equal to 1, 2, ..., m we get. 


v7 
seane' af ‘ eee 1 2. if 
2sin-—- >! -coskt=sin(x+—)i—sinz. 


2 2 
h=4 

Hence 

2sin-| + 5 cos kt |=sin ( n + =) t 
and therefore - 

sin 
(n+) 
[+ +3 cos ogee (10.54) 


Substituting (10.54) in (10.53) we finally obtain the following ex- 
pression for a partial sum of a Fourier trigonometric series: 


sin (n+ +] t 


: t 
2 SiR ~> 


S, poe. fern di (10.55) 


true at any point z of an infinite straight line. 
Remark. From formula (10.55) and from the fact that all partial 
sunis S, (x, 4) of the function f (x) = 1 are equal to unity* we get 


ee La | (10.56) 


10.5.3. The integral modulus of continuity of a function. Let a fune- 
tion f (x) be integrable (in the sense of an improper Riemann integral) 
on a closed interval [—x, a]. We extend that function periodically 
(with period 2x) to the whole infinite straight line. 

Definition. For any § of the half-open interval O0<05 < 2x the 
integral modulus of continuity of a function f (x) on a closed interval. 

—a, m] is the supremum of the integral 


1 
| F@+u)—F (ae 
-7 
on the set of all numbers u satisfying the condition |u |< 5. 
We shall denote the integral modulus of continuity of a function 
f (x) on a closed interval [—n, x} by I (5, 7). 


* For the quantity ee 12) for the func i = { is equal : 
(10.50), where ag = 2, a, = b, =O, k= ’ 
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So by definition 
at 

1(5. f= sup, lf tuy—s (Olde. 
Shae a 


The following statement is true. 

Lemma 2. Ifa function f (x) is piecewise continuous on a closed in- 
terval {—a, a] and periodically (with period 2x) extended to the whole 
infinite straight line, then the integral modulus of continuity of the 
function on that interval, I (6, f), tends to zero as § —+(, 

Proof. Fix an arbitrary ¢ > 0. According to Theorem 10.10 (on 
the closure of the trigonometric system) for a function f (x) there is 
a trigonometric polynomial 7 (z) such that 


W—7l=V f f()—T (OP dt <e/3V dz, 


and therefore on the basis of the Cauchy-Buniakowski inequality* 


x % X 

{ lf(Q—T mla<V | [f(t)—T (t)}* at \ dt<e/3. (10.57) 

—x — — 

From inequality (10.57) and from the fact that f(t) and 7 (t) are 

periodic functions of period 2x we conclude that for any number u 
at 

( if@-+u)—T+u)| dt <e/3. (10.58) 

7 


Since the absolute value of the sum of three quantities is at most the 
sum of the absolute values of those quantities, for any number u 


( (fitu)—f dl d< \ [f(i+u)—T7 (t+u)| dt+ 
+ [PEt — P(t) dt+ { |Z (1) —f (t)] dt. (10.59) 


Now it remains to note that by virtue of the continuity clatrigono- 
metric polynomial and the Cantor theorem (see Theorem 10.2 in 
[4}) for the fixed ¢ > O there is 6 >0 such that for |u| <6 and 
every t in [—z, a} 

\7T@+u) —T (t) | < clos, 


* Soe inequality (1.33). 
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and therefore 


| IT (it-u)—T (t){ dt<e/3. (10.60) 


Comparing (10.59) with inequalities (10.57), (10.58), and (10.60) 
we get 


f(@t+u)—f@|dt<e (0.64) 


Leas 


for every u for which | u |< 8. Thus the lemma is proved. 


Remark to Lemma 2. It is easy to see that the integral modulus of continutly 
I (8, jf) tends ta sera as § ~ QO not only for any piecetcise continuous function f (z) 
bul also for any f (x) integrable (in the proper Riemann sense) on a closed interval 
(—2. af. To prove this fix an arbitrary © > 0 and note that by virtueofthe 
integrability of f () on —a <t < a there isd, > O such that for any subdivision 
of [—x, 2] into subintervals of length smaller than 5, the difference between the 
upper and lower sums of f ( is less than e/4. Fix some subdivision 7 of [—a.2] 
into subintervals of equal lencth 5 < 59. From the fact that f (?) is a periodic 
function it follows that for any } uv |< 6 and for the fixed subdivision T of 
—t <t<a the difference between the upper and lower sums of the function 
i (t+ u) (with 6 sufficiently small) is at least Jess than e/2. But from this it 
follows that for the fixed subdivision 7 the difference between the upper and 
lower sums of the function {jf (¢ + u) —f (OJ, with any [uj <4, is less than 
el4 + e/2 = 3/, ©. Denote for the fixed subdivision 7 the upper and lower sums 
of [7 (t + u) — J (t)] respectively by § and s and the upper and lower sums of 


the function ] # (¢ + ux) — 7 (f) ] respectively by S and s. Jt has been established 
in Section 10.5 of [{] that for any subdivision the upper and lower sums S and s 


of the function itself and the upper and lower sums S and s of the absolute value 


~, 


of that function are connected by the relation S—s<S—s. Thus. for the 


fixed subdivision 7, S — s < 3e/4. But this means that for the fixed subdivision 


T the difference between any integra] sum of the function | f (?+ u}) —f ()] 
<1 


and the integral \ l[f@+u)—F()] at is less than the number 3e/4, 


ot 
1f we choose in that integral sum all the intermediate points =, in the 
centre of the corresponding subintervals of length 6 and require that the 
number u should satisfy the inequality | uj< 62, then both points &, 
and & -+ u will be in the Ath subinterval and therefore the difference 
t 7 (& —- u) — 7 (Ex) | will not exceed the oscillation 1, — m, of f(t) on 
the £Zth subinterval*. But then the entire integral sum is not greater than the 
sum ~* (1%, — m;,) At, equal to the difference between the upper and lower 


cums of f (t) of a subdivision 7. i.e. is not greater than the number e/4. It follows 


* By 3f, and m,z we denote the supremum and infimum of j (f) on the Ath 
subinterval. 
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at 


that for fu] < 62 the integral [ }7 (tu) —f () |] dt is not greater than 


= st 
the number e, which proves that J (6, f) tends to zero as § ~ 0. 


We now derive from Lemma 2 a number of corollaries impor- 
tant for what follows. 

Corollary 1. If a function f (t) is piecewise continuous on a closed 
interval [—ax, x] and periodically (with period 22) extended to the 
whole infinite straight line and x is any fixed point of {—ax, x], then 
for any & >O there is § > 0 such that 


{ if (etitu)—f(z+ldt<e, (10.62) 
with lu |< 6. 


Proof. On making in the integral on the left of (£0.62) a change of 
variable r=2+ft 


lif(ztt+uy—f(etnia= | yfir+uy—f (rar 
—% a THEN 
and observing that (by virtue of equation (10.52)) 
T4+X ba 
) ertuy—f@lar= | (etm —flar, 
— +x ~1 


we see that inequality (10.62) is a consequence of (10.64). 

Corollary 2. If each of the functions f (t) and g (t) is piecewise con- 
tinuous on |—x, a] and periodically (with period 2x ) extended to the 
whole infinite straight line, then the function 


I(z)= | f(z+ne (nar 


— st 


is a continuous function of x on a closed interval —na <. rn. 
Proof. Let x be any point of [—a, a). Then 
et 


[(r+u)—/ (x)= ( (f(r+t+uy—f(x+t)} g(t) dt. 


— 1 


and since the function g (f) piecewise continuous on [~z, a] sat- 
ishes on that interval the boundedness condition | g (8 |< 1/, we 
have 
st 
[IT (e+uy—I(x)(<a [f(t ttuy—f(e+n fae 


~ 3T 
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and therefore in view of (10.62) for any e >0 
[\l(e+uy—T(e)[<e, iff[u |< 6 (e). 


Thus the continuity of J (xz) at a point zx is proved. 

Corollary 3. If each of the functions f (t) and g (t) is piecewise con- 
tinuous on [—n, x] and periodically (with period 2x) extended to the 
whole infinite straight line, then the Fourter trigonometric coefficients of 
the function F (x, t) = f (x + t) g (t), when this function is expanded 
in terms of t, 


ay (2) = = \ f(x+t) 2(t) cos nt dt, (10.63) 
bn (x)= — | f (x-b4) g(t) sin nt dt (10.64) 


converge to zero (as n ~+ 00) uniformly with respect to x on [—nx, a] 
(and therefore on the whole infinite straight line). 

Proof. For any fixed point x of [—ax, x] the function F (x, t) = 
= f(z + 2) g (t) is a piecewise continuous function of the indepen- 
dent variable ¢ on j|—zx, x] and therefore Parseval'’s formula 


HO) 4D fai (2) LRM = YF fete) 2 (1) at (10.65) 
k=1 —<t 


holds for this function*. Equation (10.65) implies the convergence 
of the series on its left at each fixed point xz of [—x, a]. Since that 
series consists of nonnegative terms, to prove its uniform convergence 
on [—zx, 2] it suffices by the Dini theorem*™* to establish that both 
each of the functions a, (x) and b, (x) and the sum of the series (10.65) 


st 


= (f {x + t) g® (t) di are continuous functions of z on [—nx, a], 
~s 
but this follows at once from the preceding corollary (it suffices to 
consider that the square of a piecewise continuous function is a piece- 
wise continuous function and that cos nt and sin nt, with every n 
fixed, are continuous functions). 
Corollary 4. If each of the functions f (t) and g (t) is piecewise con- 
tinuous on [—x, 1] and periodically (with period 2x) extended to the 


* See Corollary 4 in Section 10.3.3. 
** See Theorem 1.5 (statement in terms of series). 
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whole infinite straight line, then the sequence 


cy (2) ==> \ f(z-+t)e(tsin(n +) tat (10.66) 


converges to zero uniformly with respect to x on {—a, a\ (and therefore 
on the whole infinite straight line). 
Proof. It suffices to consider that 


. 4 . ft . t 
Sin (n+5)1=cos nt-Sin => -{- Sin nl + COS <> 


and apply Corollary 3, taking the function g (/)-sin 5 instead of 


g(t) im (10.63) and the function g (1): cos instead of g(t) in 


2 
(10.64). 

10.5.4. The principle of localization. We shall prove here that 
the question of whether the Fourier trigonometric series of a func- 
tion f (xz) piecewise continuous on [—x, a] and periodical (with 
period 21) converges or diverges at a given point zy can be solved 
only on the basis of the behaviour of the function f (x) in arbitrarily 
small netghbourhood of x). This remarkable property of a Fourier 
trigonometric series is generally caJled the principle of localization. 

We begin by proving an important Jemma. 

Lemma 3 (Riemann). If a function f (x) is piecewise continuous 
on [—x, 2] and periodically (with period 22) extended to the 
whole infinite straight line and if it vanishes on some interval 


fa, b]*, then for any positive number 6 smaller than “5 the 


Fourier trigonometric series of f(x) converges to zero uniformly 
on the interval {a-+6, b—8]. 
Proof. Let 6 be an arbitrary positive number smaller than 
b—a 
5: 
f(x) at an arbitrary point z of an infinite straight Jine is defined 
by equation (10.55). Setting 


The partial sum of the Fourier trigonometric series of 


when 6<|¢t|<a, 
g(t)=y 25ing (10.67) 
L 0 when [i{<§ 


and considering that f (x + 1) is zero provided z is in [a + 6, b — 4] 
and ? is in the interval | ¢ | < 6** we may rewrite (10.55) for each 


* The closed interval {a, b] is quite arbitrary. In particular, it may be con- 
tained entirely in J—a, =<}. 
** By virtue of the fact that f (z) is zero throughout [a, 8]. 
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z of [a + 6, b — 6] as 


Sn (a, == | f(ett)g(t)sin(n+z) tat. 


It remains to take into consideration that by Corollary 4 of Section 
10.3.3 the sequence on the right of the last equation converges 10 
zero uniformly with respect to x on the whole infinite straight line. 
Thus the lemma is proved. 

Immediate consequences of Lemma 3 are the following two 
theorems. 

Theorem 10.13. Let a function f(x) be piecewise continuous 
on a closed interval {~—a, x] and periodically (with period 2n) 
extended to the whole infinite straight line and let {a, b] be some 
closed interval. For the Fourier trigonometric series of f (x) to 


converge, with any positive 6 smaller than = , (to f(x)) uni- 


formly on the interval fa+56, b—§&) it is sufficient that there is 
a periodic (with period 2x) function g(x) piecewise continuous orn 
[—x, m1] that has a Fourier trigonometric series uniformly con- 
verging on {a, b] and coincides on [a, b] with the function f(z). 

Proof. Applying Lemma 3 to the difference [f (x)—g(x)] we 
see that the Fourier trigonometric series of the difference [f (z)— 
g(zx)] converges to zero uniformly on [a-+6, 6—86] for any 6 in 
the interval 0<b< 8 , and from this and from the uniform 
convergence on [a, b}] of the Fourier trigonometricseries of g (z) 
we obtain the uniform convergence on [a+ 6, b—6] of the Fou- 
rier trigonometric series of f(z). The fact that it is to f(x) that 
the last series converges on {a+6, b—8] follows immediately 
from Corollary 5 of Section 10.3.3. Thus the theorem is proved. 

Theorem 10.14. Let a function f(x) be piecewise continuous 
on a closed interval [—xn, x] and periodically (with period 2x) 
extended to the whole infinite straight line and let x, be some point 
of the line. For the Fourier trigonometric series of f (x) to converge at 
Ly it is sufficient that there is a periodic (with period 2x) function 
g (x) piecewise continuous on [—x, x) that has a Fourier trigonometric 
series converging at the point x, and coincides with f (x) in an 
arbitrarily small §-neighbourhood of 2p. 

Proof. Jt suffices to apply Lemma 3 to the difference [(f («)— 


m+ | and consider that the 


—g(zx)} on the interval | m—+, 
convergence at 2%) of the trigonometric series of the functions 
[f (z) — g (x)] and g(z) implies the convergence at that point of 
the Fourier trigonometric series of f(z) as well. Thus the theo- 


rem is proved. ” 
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Theorem 10.14 does not establish any particular sort of conditions 
that ensure the convergence of the Fourier trigonometric series of 
j (z) at a point zp». It only proves that those condilions are deter- 
mined solely by the behaviour of f(z) in an arbitrarily small 
neighbourhood of x, (i.e. have a local character). 

10.5.5. Uniform convergence of a Fourier trigonometric Series 
for functions of a Holder class. In this and the next subsection we 
shall discuss more precise conditions ensuring the uniform conver- 
gence and convergence ofa Fourier trigonometric series at a given point. 

We prove the following main theorem. 

Theorem 10.15. If a function f (x) belongs on [—a, =a] to a Hilder 
iclass C® with an arbitrary positive inder a (QO <a <1) and if in ad- 
dition { (—a) = f (x), then the Fourier trigonometric series af f (zx) 
converges (to that function) uniformly on {[~a, a). 

Proof. We shall assume as usual that f (x) is extended periodically 
(with period 22) to the whole infinite straight line. The condition 
f(—a) =f (a) ensures that the function thus extended belongs 
‘to a Holder class C2 on the whole infinite straight line. 

Let x be any point of [—az, a]. Multiplying both sides of (10.56) 
by { (rv) and subtracting the resulting equation from (10.55) we get 


‘ 33 sin ( 5) 
Sy (z. /)—f(z)= = [ ae) (z))— 7 dt. (10.68) 
—7 sin > 


‘The condition that f (x) belongs to a Hélder class C* implies that 
there is |J such that 
[f(z +t) —f(z) |< M-te (10.69) 

in any case for every x and every ¢ in [—a, al. 

Fix an arbitrary e > 0 and choose for it 6 > 0 satisfying 

MM a E ~ 

=—-5 <j Z- (10.70) 
Dividing [—x, x] into a sum of the closed interval |¢ |< 6 and 
the set 6 < {t | <a, we make equation (10.68) take the form 


. 1 
Ssin{n—-—~— fl 
Sn (a, f)—f(2)=— | fees n— pay OE) ang 


pt {<6 2 sin 


n(n t 
fa f se(enz)t ay, (10.71) 
7 
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To evaluate the first of the integrals on the right of (10.71) use 


inequality (10.69) and consider that <> for every 
2 in| ” 


fin [_—a, a]*. 
We see that for any 7 and any rin [—2x, =] 


1 
[f (z+ t)—f (z)] vn (n z)! dys 

1t1<6 2 sin > 

1 

< | if@+o—f(@| soln: 2) dt 
(t}<s 2| sin x | 

<3 bi arm ata fs @—F t= a 6°. 

1ti<6 0 


From this in view of (10.70), for any m and any z in [—a, =] 


+ \ [f (e+ t)—f (2)]} 4 aj <<. (10.72) 
° <6 2sin— ° 

= 2 

Using the function (10.67) piecewise continuous on the second of 
the integrals on the right of (10.71) can be written as 


| 


= \ f(z@+1) g(t) sin (n+ 5) t. 


= at 


By Corollary 4 of Section 10.5.3 the right-hand side of the last equa- 
tion converges to zero (as m —+ oo) uniformly with respect to z on 


* This inequality follows immediately from the fact that the function 
sia7 


decreases from 1 to 2/m as x changes from 0 to 7/2. The fact that 


sinz sinz\"’ coss 
—= decreases follows In turn from the fact that ==} =—5 


X (r—tan z) <= 0 everywhere for 0< 2 < 2/2, since z<tanz for 0<cz< 21/2 
(see Section 4.5.6 of {1}). 
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{—x, x]. For the fixed « > 0 therefore there is V, such that 


j ¥ ' 2 e _ 
7 | f(x -b 1) ———— <3 (10.73 
S<pt (<x 2sin > 


for every n >> A, and every x in [—a, al. 

To evaluate the last integral on the right of (10.71) notice that 
using the piccewise continuous function (10.67) this integra] can be 
written as 


{ 
sin (n+ t . “T 
Lc) etre) yi | eysin(x+y)tde. 
é<fti<a 9 2 5IN on : 


The integral on the right of the last equation converges to zero (ns 
n ~—» oo) also by Corollary 4 of Section 10.5.3 (it suffices to apply 
that corollary to the function f(z) = 1). Considering also that 
f (x) is in any case bounded on [—az, a] we see that for the fixed arbi- 
trary ©€ > 0 there is NV. such that 


dii<z (10.74) 


for every n> NV. and every x in [—a, a). 

On denoting by .V the largest of the two integers Vy, and NV. we 
find by virtue of (10.71) to (10.74) that for the fixed arbitrary ¢ > 0 
there is N such that 


1S, (@, f)—fzi<e 


for every n > WN and for every x in [—a, aj. Thus the theorem is 
proved. 

Remark 1. [tis obvious that under the hypotheses of Theorem 10.15 
the Fourier trigonometric series converges uniformly not only on 
[—n, x}, but also uniformly on the whole infinite straight line (to \he 
function which is a periodic (with period 2n) extension of f (x) to the 
whole infinite straight line). 

Remark 2. Note that in evaluating the integrals (10.73) and (10.74) 
we used only the piecewise continuity (and the resulting bounded- 
ness) of the function f (7) on [—a, a] (no use was made of the fuct 
that f (7) belongs to a [ojder class). 


Remark 3. The question naturally arises as to whether it is possible to slack- 
enin Theorem 10.15 the smoothness requirement on 7 (7) while preserving the 
statement about the uniform convergence on [—2, a] of the Fourier trigonometric 
series of f (z). 
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Recall that the fact that f (x) belongs on [—x, a] to a Holder class C® means 
by definition that the modulus of continuity of / (x) on [—a, x] hag an order 


o (5, f) = 0 (8%). 


We note without proof the so-called Dini-Lipschitz theorem which states that for 
the Fourier trigonometric Series of { (zr) to converge uniformly on [—a, nm] it is 
sufficient that f{ (z) satisfies f (—a) = f (x) and that its modulus of continuity on 
{—n, 2) has an order 


le. iS an infinitesimal, as 6 > 0, of an order higher than = 


The Dini-Lipschitz theorem contains a final condition (in terms of the modulus 
of continuity of a function) for uniform convergence of the Fourier trigonometric 
Series of the function, for it is possible to construct a function f (z) satisfying 


{(—7) = f (a) with a modulus of continuity of order O (a) on (—a, a] 


and with a Fourier trigonometric series diverging on a set of points everywhere 
dense on [—a, aJ*. 


Under the hypotheses of Theorem 10.15, after a periodic (with 
period 2m) extension the function f (z) was found to belong to a Hél- 
der class C% on the whole infinite straight line. The question naturally 
arises as to the behaviour of the Fourier trigonometric series of a 
function f (z) belonging to a Hélder class C* only on some closed 
interval (a, b) and Satisfying but the ordinary piecewise-continuity 
requirement everywhere outside it. 

The answer to this question is given by the following theorem. 

Theorem 10.16. Let a function f (x) be piecewise continuous on(—zx, 21] 


line. Suppose further that on some closed interval [a, b) of length smaller 
than 2x it belongs to a Hélder class C® with an arbitrary positive index 


a(0<a<1). Then for any & in the interval 0 < § < the 


Fourier}trigonometric series of f (x) converges (to that function) uni- 
formly on the closed interval {a + 6, b — 6), 

Proof. Construct a function g (x) that coincides with f (z) on [a, 0), 
is a linear function of the form Az + Bon (b, a + 2x] turning into 
f((b)iwhen z = b and into f (a) when z = a + 2n** and that is perio- 

ically (with period 2m) extended from a closed interval la, a + Qn] 
to the whole infinite straight line (in Fig. 10.1 the heavy line re- 


ese, 


* A proof of the Dini-Lipschitzt' orem and a construction of the example 
just Mentioned can iS found in the book: A. Zygmund, Trigonometric Series, 
59, 


** The condition that the function Az+B turns into /(b) when z=» 
and ip f(a) when (ot 2m) f gel defines the constants A and B: 
A —~1(b) (a+ 2x —9F (a 

’ a--in—p? B= a+2n—b 
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presents the graph of f (c) and the broken line is the graph of a fune- 
tion g (x) constructed from f (z)). 

It is obvious that the constructed function g (x) satisfies the con- 
dition g (—a) = g(a) and belongs to a Idlder class C% (having 
the same positive index @ as f (r)) on the whole infinite straight 
line*, By Theorem 10.45 and Remark 1 the Fourier trigonometric 
series of g (x) converges uniformly on the whole infinite straight 
line and therefore by Theorem 10.13 the Fourier trigonometric series 


y | 


Fig. 10.1 


of / (x) converges (to that function) uniformly on [a + 6, b — 4] 
. . b— rn . 

for any 4 in the interval 0 < 6 <—=. Thus the theorem is proved. 

Remark 4. The statement of Theorem 10.16 remains true for 

a closed interval fa, b] of Jength equal to 27 (i.c. forthe case 

b=a-+-2n), but then in proving the theorem, on fixing an arbi- 

trary O in the interval O<S6<a, we should take the function 


g(x) to coincide with f(z) on E -i S at2n—s |, to be Jinear 


9) 8) . ye . . 
on | @ 2i— 5, a+ Ine 5 and periodically (with period 2:7) 


extended from | « + > a4-24 + 3] to the whole infinite straight 
line. But if fa, b] is of length greater than 22, then from { (2) 


belonging to a Hélder class C* on such an interval and: from 
the periodicity condition of f(z) (with period 22) it follows that 
f(z) belongs to a class C* on the whole infinite straight line, 
i.e. in this case we arrive at Theorem 10.45. 

10.5.6. On the convergence of the Fourier trigonometric series of 
a piecewise Holder function. 

Definition J. Weshallsay that a function f(x) is piecewise Hélder 
on a closed interval la, b] if it is piecewise continuous on a, b) and if 
la, b] can be divided by means of a finite number of points a = 2 < 


* It suffices to consider that ¢ (r) is everywhere continuous and that a 
linear function has a bounded derivative and belongs therefore to a Holder 


class C* for any @ < 1. 
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Sty <Ctase.. Sa, = Db into subintervals |x, o,) (hk = 4, 
2,..6, Rn) on each of which that function belongs to a Hélder 


class C™ with same positive under a, (A< a, <1); when defining 
the Hilder class on a subinterval la; _,. a;) the amiling values f (ay, + 
+ Q) and f (ry — 0) should be taken as the values of the function at 
the end points of fay iy. ry )*. 

In other words, the domain of any piecewise Holder function falls 
into a finite number of closed intervals having no interior points 
in common on each of which the function belongs to a Hélder class 
with some positive index. 

Each of those intervals will be called a@ smoothness section of the 
function. 

Definition 2. We shall sau that a function f(x) is piecewise smooth 
ona closed interval la, bl if it ts piecewise continuous on fa, bd) and 
has a piecewise continuous derivative on la, b\**, ie. if F(z) is piece- 
wise continuous on la, bland its derivative { (x) exists and is continuous 
everinthere an la. b) ercept possibly for a finite number of points at 
each of which the function f' (a) has finite right- and left-hand limiting 
values, 

Itis clear that any function prevewise smooth on a closed interval 
lz, b) is piecewise Holder on it. 

The JolJowing main theorem holds. 

Theorem L047. Leta function f (2) piecewise Hélder on an interval 
[—a, a] be periodically (vith period 22) ealended to the whole infinite 
stratght line. Then the Fourier trigonometric series of f (xr) converges 
ateach point © of the intinile line ta the value f (rc) == 1.2 [7 (7 — 0) + 
—fi(r-- QO), the convergence of that serics being uniform on cvery 
fixed closed interval inside a smoothness section of 7 (x). 

Proof. The statement of the theorem about the uniform conver- 
gence on every fixed closed interval inside a smoothness section 
follows immediately from Theorem 10.16. Also following from this 
theorem is the convergence of the Fourier trigonometric series of 
f(x) at each interior point of a smoothness section of f (z)7*". Tt re- 
mains to prove the convergence of the Fourier triganometric series 
of f (xz) at each point of junction of two smoothness sections. 

Fix one of such points and denote it by a. Then there are constants 
A, and AL. such that for any sufficiently sma)) positive z 


Lie) —f Ft O(S Mt O<ay <1), (10.75) 


* A piecewise Holder function, as any piecewise continuous function, must 
have values equal at every point 2; to a half-sum of the right- and left-hand 
limiting values at that point, ic. the equation f (1,) = t/2 [7 (x), — 0) + 
+. f(z, -+ 0)) must hold. 

** See Definition 1 in Section 10.4.2. 
#3* For onch interior point of a smoothness section may be taken as a neigh- 
bourhood of a closed interval inside that section. 
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and for any sufficiently small negative ¢ 
[f(rtiy—fi(r—O){RM- [tf (O<a,.<1). (10.76) 
Denote by A/ the largest of the numbers Jf, and AJ. and by @ the 
smallest of the numbers a, and a.. Then, with | ¢ | < 41, on the right 
of each of the inequalities (10.75) and (10.76) we may wrile Al. | t [%. 
Now fix an arbitrary e > 0 and choose for it 5 > 0 satisfying 
(10.70) and so small that both inequalities (10.75) and (10.76) hold 
for [¢ |< 6 and we may take a number AZ- | t [* on the right-hand 
sides of those inequalities. Repeating the reasoning of the proof for 
Theorem 10.45 we arrive at equation (10.71) and to prove the theorem 
it remains for us to show that the estimates (10.72), (10.73), and 
{10.74) hold at the fixed point z. In Remark 2 of Section 10.5.5 we 
have noted that the estimates (10.73) and (10.74) hold for any fune- 
fion only piecewise continuous and periodical (with period 2n). It re- 
mains to prove the validity for all x of the estimate (10.72). 
Having in mind that f(z) = 1/2 [f (z—0)+f# (z4-0)) and that* 


6 sin (nts) na2{ sin (n+) ¢ lin? sin(nt+3)¢ 
t . . 


; dt 
—6 2sin > 0 2sin-=> ae, 2 sin 7 
we may rewrite the integral on the left of (10.72) as follows: 
. 1 
4 sin are 
~ { tf(ete—fiay a ne? 
1 1<6 sin + 
{ 6 L sin (n+<) t 
=— | [f(z+t)—f(2+0)] —— dt + 
0 2sin > 
' 0 sin (n4 =) t 
4 | ff (2-+0)—f (2 —0)] —-——— at. (10.7) 
~ s 2sin — 


sin ( 5) 
In| rs 


2 sin > 


* In view of the fact that the function @ (t)= ig even, 


i.e. satisfies p(—t)= @ (‘) for any #. It is casy to see that for such a function 
ra) 0 
@ (t) dt= p(t) di (it suffices to make a substitution t= —t in one of 
0 a 


~§ 
the integrals) and therefore 
6 


(t)di=2 { g (t)dt=2 @ (t) dt. 
-6 0 -6 
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To evaluate the integrals on the right of (10.77) use inequali- 
ties (10.75) and (40.76), writing on the right-hand sides of the 


inequolitios a aumber M\t|*. Taking into account the estimate 
<7 (with [¢{<<m), already applied in proving 
2) 51D = ; 

Theorem 10.5, and inequality (10.70) we have 


{oe sin(n+s)t 
a \ Uf(e+i—fF(2)]-——>— 
it [<6 2sin> 


ays 


6) 0 
<3() mart | be jor! dt] az 8 <5. 


Thus the estimate (10.72) and hence the theorem are proved. 

Corollary 1. The statement of Theorem 10.17 will clearly be true 
if we take a function piecewise smooth instead of piecewise Hilder 
(on {—x, x]), periodically (with period 2x) extended to the whole in- 
finite straight line. 

We introduce a new concept to formulate one more corollary. 
Let O<a<ci. 

Definition 3. We shall say that a function f (x) satisfies at a given 
point z on the right (left) a Holder condition of order « if f (x) has 
at the point x a right-hand (left-hand) limiting value and if there is 
aconstant AT such that for all sufficiently small positive (negative) t 


| f(z toh—f (z+09) | | f (x-+-t)—f (ec —9) | 
SM (ee). 


It is obvious that if f(z) has at a given point z a right- 
hand (left-hand) derivative understood to be the limit 


lim Let0=1e—0 ), 


f (z4+-t)—f (x-+0) ( 
t t--0—-0 


lim 

{+0+0 
then f(x) a fortiori satisfies at that point z on the right (loft) a 
Holder condition of any order a<ci, 

Corollary 2 (condition for convergence of a Fourier trigonomet- 
ric series at a given point). For the Fourier trigonometric series of a 
piecewise continuous and periodic function f (x) (with period 2m) to con- 
verge at a given point z of an infinite straight line it is sufficient that 
f (x) satisfies a Hélder condition of some positive order a, at the point x 
on the right and a IfGlder condition of some positive order ad, at the 
point z on the left (and it is clearly sufficient that f (zx) has a right- 
hand and a left-hand derivative at the point x). 


22~-01684 
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Proof. It suffices to notice that from the fact that f (x) satisfies 
a Holder condition of order a, (order a.) at a point z on the right 
(left) it follows that there is a constant JJ, (a constant AJ.) such that 
inequality (10.75) (inequality (10.76)) holds for all sufficiently small 
positive (negative) ¢. But our proof of Theorem 10.17 uses only in- 
equalities (10.75) and (10.76) and the piecewise continuity and perio- 
dicity of f (z). 

Example. Without computing the Fourier coefficients of the func- 
tion 

cosx when —xx<iz<0, 
f(z)= 2 1/2 when z=0, 


Vaz when O<zca, 


we may state that the Fourier trigonometric series of that function 
converges at a point z = 0 toa value of 1/2, for f (x) has at that point 
a left-hand derivative and satisfies at it on the right a Holder condi- 
tion of order a, = 1/2. 


10.5.7, The summability of the Fourler trigonometric serics of a continuous 
function by arithmetic means. We have already noted that the Fourier trigono- 
metric series of an everywhere continuous and periodic function (with period 27) 
may be divergent (sce Section 10.5.4). We prove that that series is nevertheless 
always summable (uniformly on the whole infinite line) by the Cesaro method 
(or the method of arithmetic means)*. 

Theorem 10.18 (Fejér**). Jf a function { (x) {s continuous on an Interval 
[—x, n] end satisfies the condition { (—a1) = f (x), then the arlithmette means of 
the partlal sums of tts Fourter trigonometric sertes 


So(z, +S, (z, fy-+... +Sn-y (2, f) 


n 


converge (to that function) untformly on [~—a, x] (,and provided a function with 
period 2n is extended to the whole Infinite line, they converge untformly on the whole 
infinite Ine). 

Proof. From equation (10.55), for S, (z, f) we get 


Sn(z, N= 


ow 


n= { 
One, oe lee | sin (x+5) | dt. (10,78) 
She 


To compute the sum in the brackets of (10.78), sum the identity 

2 sin as sin (145) t=cos kt—cos (k-+-1) ¢ 

2 2 
overall k= 0,4,.-.,n ~—1. Asa result we get 
jay { t 
tn ot aon ee = g nt 

2 sin -> »Y sin ( h x) t= cos ni=2 sin ae 

he= 


* See Supplement 2 to Chapter 13 of [4]. 
** 7. Fejér (1880-1959) is a Hungarian mathematician. He proved his 


theorem in 1904. 
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Using the last equation reduces (10.78) to the form 
rut 


‘ mt sin? ~- 
on (et, f)=— | (@-+9-——. dt. (40.79) 
~x 2 sin? = 


From (40.79) it immediately follows in turn that 


M ein’ ne 
n §1n > 
— | ———*_di=1, (40.80) 
nn t 

1 2 sin* = 


for the left-hand side of (40.80) is equal to the aritlinetic mean of the partial 
sums of the Fourier trigonometric series of the function f (x) c=: 1 and all the 
partial sums are identically equal to unity (see Section 10.5.2). 

Fix an arbitrary e >> 0. According to Theorem 10.9 (Weierstrass) there is a 
trigonometric polynomial 7 (z) such that 


tf (x) — T (x) | < 2/2 (10.81) 


for all x of the infinite line. By the linearity of arithmetic means a, (z, f) = 
=o, (x, f — T) +6, (z, T), so that 


lon (2, —TWI<lonesf—Ml+lon i T)—Tl)} 0.82) 
On writing equation (40.79) for the function [f (r) — T (x)} we get, taking into 


. a Tt 
sin ~~~ 
account the nonnegativeness of the function 2. called the Fejér kernel 
2 sin® > 
and using the estimate (10.81) and equation (10.80), 
1 f sin? 
lon(z, 1—-TI<S | 1 e+9-T e+) 1—pa< 
t 
T gin2 Ride 
<et. 2 ty a (10.83) 
2 stn sag t 2 
~ IT 2 sin oY 


Inequality (10.83) holds for any n. Now notice that the Fourier trigonometric 
series of a polynomial 7 (r) coincides with that polynomial. It follows that all 


partial sums S,, (x, T) beginning with some integer ng are equal to 7 (r). But 
this allows us to find for the arbitrary e > 0 fixed above an integer N such that 


{o, (xz, T) — T (2) | < 2/2 (10.84) 
for all n> WN and all z. 

From inequalities (10.82), (10.88), and (10.84) we conclude that | o, (z, f) — 
— { (z) | < e for all n > WN and all z. Thus the theorem is proved. 

10.5.8. Coneluding remarks. 1°. In solving a number of specific 
problems one has to expand the function into a Fourier trigonometric 
series not on the interval [—x, x} but on [—J, i] where / is an ar- 
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hitrary positive number. To go over to such a case it suffices to re- 
, iu . 
place the variable x by +x throughout the above reasoning. Of 


course, all the established results will remain valid under such 4 
linear change of variablo. They will apply to the Fourier trigonomet- 
ric series 


= -f- »y (a, cos — ka -{- by sin = iz | (10.85) 
h->4 


with the following expressions for the Fourier coofficients 


l 
1 
oom az | (ya, 


-! 

t + (10.86) 
at | ficostirdt, m= 4 { fe)sinZaea | 

-) ~I 
(hz 4, 2) ...). } 


Wo shall not reformulate any of the established theorems but merely 
note that in all the statemonts the interval [—z, 2] should be re- 
placed by [—-/, and tho period 27 by a poriod 2). 

2°. Recall that a function f (xz) is said to be even if it satisfies the 
condition { (~2) = f(x) and odd if it satisfies f (—x) = —f (2). 

From the form (10.86) of Fourier trigonometric coefficients it fol- 
lows that for an even function f (z) all the coofficionts b, (k = 4, 
2, ...) are equal to zero and for an odd function f (x) all the cocf- 
ficients a, (*# = 0, 4, 2, ...) are oqual to zoro. Thus an even fune- 
tion f (v) cau be expanded into a Fourier trigonometric series only with 
respect to cosines 


iy) 
st 
P+ » ay, COS =F hx 
her 
and an odd function f (x) can be expanded into a Fourier trigonometrie 
series only with respect to sines 
a 
. xt 
SY by sin a ker, 
herr} 
3° We cive a vory froqnuently used complex notation for the Fourier 
trigonometric series (10.85), 
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Using the relations* 


. x 
—{— hk: $ . 5 {— hk: 3 «e 

ar "= cos + ka — isin ke, e ! “= cos he-+isin + he, 
it is easy to see that the Fourier trigonometric series (10.85) with 
the Fourier coefficients (10.86) reduces to the form 


ee ~ict hx 
So get (10.87) 


where the complex coefficients c, are of the form 
d pr 
ia ht 
—s | f(tje t adi (10.88) 
-1 
and can be expressed in terms of the coefficients (10.86) by the for- 
mulas 
a 2n—ib an ibn, . 
C=: Cy =a (k=l, 2, weeds 
4°. Especially important for applications is the problem of eval- 
uating a function from approximate Fourier coefficients of that 


function. A solution of this problem using the so-called regulari- 
sation method is given in the Appendix. 


10.6. THE FOURIER INTEGRAL 


When a function f (x) is given on the whole infinite line and is not 
periodic whatever the finite period, it is natural to expand it not 
into a Fourier trigonometric series, but into the so-called /ourier 
integral. 

The present section is devoted to the study of such an expansion. 

Throughout it we make the function f (x) obey the requirement 
of absolute integrability on an infinite line (—0oo, 00), i.e. we re- 
quire that there should be an improper integral 


oo 


| f(z) | dz. (40.89) 


6 
ow OO 


Let us agree to use the following terminology. 

Definition. We shall say that a function f (x) belongs on an infinite 
line (—oo, oo) to a class L, and write f (x) € Ly (—oo, 00) tf f (x) 
is integrable (in the proper Riemann sense) on any closed interval and 
if the improper integral (10.89) converges. 


* These relations are immediate consequences of the Euler formula estab- 
lished in Section 1.5.3 
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10.6.1. The Fourier transform and its simplest properties. 

Lemma 4. If f (x) € L, (—oo, oo), then for any point y of an in- 
finite line —w < y <tc there is an improper integral* 

fly)= \ ef (2) dz (10.90) 


called the Fourier transform (or image) of the function f (x). Moreover, 


the function f (y) is continuous in y at each point of the infinite line 
and tends to zero as |y | ~o, i.e. 


Jim tf(y) |=0. (40.94) 
y Joc 
Proof. It follows from the equation | e” f(z) |] = | f(z) |, from 


the convergence of the integral (10.89) and from the Weierstrass 
test (see Theorem 9.7) that the integral (10.90) converges uniformly 
in y on each closed interval of the infinite line, and hence, by virtue 
of the continuity of the function e'* in y, it follows from Theorem 9.9 
that the integral (10.90) is continuous in y (on each closed interval, 
i.e. at each point of the infinite line). 

It remains to prove the relation (10.91). Fix an arbitrary © > 0. 
In view of the convergence of the integral (10.89) we may fix A > 0 
such that 


-A _ 
\ | f(z) | dx \ | f(z) | dr<z. (10.92) 
one A 


With A thus fixed (by virtue of (10.92)) we have 


oc A 
| | ef (2) dz|<| | eff (x) dx|+ = (10.93) 
a ~A 
and for the relation (10.91) to be proved it remains for us to 
establish that the integral on the right of (10.93) is less than 


Ze for all sufficiently large | y |. 


Since f (x) is integrable on [—A, A], we can fix a subdivision T 
of [—A, A} such that for the upper sum S7 of the subdivision*® 
A 


0<S>— \ f(z)dz<t, (10.94) 


-A 


* The complex function f (vy) = u (y) + fv (y) of a real-valued independent 
variable y is considered as a pair of real-valued functions u (y) and pr (y). The 
continuity of j (v) at a given point y is understood to be the continuity at that 
point of each of the functions u (vy) and v (u). 

**® See Sections 10.2 and 10.3 of [4]. 
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Suppose that the subdivision T is made using the points —A = 
= 290%, Sig, ...< 2, =A and that M, is the supremum 
of f (x) on a subinterval [z,_,, z,] (k = 1, 2, ..., 2). 

We introduce the function 

7. (2) =| M, when z,<r7<z, (k=1, 2, ..., 7), 

710 when z=a, (k=0, 4, 2, ..., 7). 
Since an integral is independent of the value of the integrand in a 
finite number of points, it is obvious that 

A 


| Fr (2) da= D) My (t4— 24-1) = Se, 


-A k=t 
so by (10.94) 
A A 


| tir (@)—F(@) (de= | fe (e)—fide<F. (10.95) 


-A -A 


Relying on inequality (40.95) and considering that | e**” | =1 and 
x 


that | j eV dc |<2/ | y | we have 


“hor 


| Fen (x) da| =| F 017 (2) Fr (x) -+ fp (x)] da|< 
“A “4 


<| f eV (2) dx| +| i oY [Fx (a) —f (x) dz|-< 
~A ~A 


nr ed 27 A 

<DIMal-| | evdelt | lie (@)—F@) lde< 
k=1 py -A 
2 < 2 

Sq DI Mal t+g<ze 


nr 
provided |y >t] > | M, |. Thus the lemma is proved. 
hot 
Corollary. If f (x) € L, (—oo, ©), then 


lim J cos Az-f (x) dr=0, iim \ sin Az-f (x) dxz=0. 


oo —Oo 
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10.6.2. Conditions for the expansion of a function into a Fouricr 
integral, 

Definition. For each function f (x) of the class L, (—0o, c) the 
limit 


A , ow 
(provided it exists) is called (he expansion of the function into a Fourier 
integral. 

We prove the following main theorem. 

Theorem 10.19 (condition for the expansion of a function at a 
given point intoa Fourier integral). If f (x) € L, (— oo, co) and if f (x) 
satisfies a Hdlder condition of some positive order a (0 <a, <1) 
ata given point x on the right and a Holder condition of some positive 
order Go (0 <1 a. < 41) on the left, then at that point x 

a 


lim se [ ef (y) dy= LETOEIE 9 | (10.96) 

Aco oT oo 2 

Remark 1. At every point z at which the value of f (x) is equal to 
a half-sum of the right- and left-hand limiting values (in particular, 
at each point of continuity of f (z)) we may write f (z) on the right 
of (10.96). 

Proof of Theorem 10.19. Since the Fourier transform ¢ (y) (in 
view of Lemma 4) is a continuous function of y, for any positive 2 
there is an integral 

} 


. ) co 
~ixyF (1) dy a= \ e7i* iu lu \ dy. 10.97 
\e UF (y) dy je [J vf (u) du | dy (10.97) 


= Jy 


In the integral on the right of (40.97) we may change the order of 
integration with respect to y and u (since the inner integral con- 
verges uniformly with respect to y on any interval [—2., 4)). 

Change the order of integration with respect to y and u, uso 
the equation e’"'—) — cosy (u—z) + isin y (u—z), 


a a 
cos y (u— Zz) dy = SE { sin y (u—z) dy =0 
aly 3 me 


and make the substitution u= 2 + ¢ to pet 
2 


co 2. 
se Sei Wdy=ge | [ (ees dy | f(y) du= 
: ~c =) 


¢ sink(u—z | r si }, 
a | ES duet | SY etna. 
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So for any positive 4 
rN 


1 ~ixyf dt = 
pe \ ei (uy) ay 
1 f sink 1 sind 
sin 
= | SS tetna+— |S SIN § 4 4) at. (10.98) 
co 0 

Now note that for any positive % we have the equation* 
co 0 

sin At _ a sin _ a 
dt=—- and, therefore, —— t#==. 


It follows from the last two equations that for any positive A 


f(zt+ oe Wie 4.0) Sim at sini dt, (10.99) 
0 
0 

f 9 =+ J f(x—0) SEM at. (10.100) 


Subtracting from. (10.98) equations (10.99) and (10.100) we see that 
for any positive A 


4 
Je ~ixy# (y) d y—Lerdtie _ 


oo 


== | IF (2-4-1) —f (x-+0)) Seat see dt -- 


° 


at 


$5 j (fF (x-+t)—f (e—0)] S2™ ae. (40.101) 


Since f (z) satisfies a Holder condition of order a, on the right 
and a Hélder condition’ a. on the left, there are constants AJ, and 
M, such that inequality (40.75) holds for all sufficiently small posi- 
tive t and inequality (40.76) does for all sufficiently small negative ¢. 
li we denote by Jf the largest of the numbers MJ, and Mf, and by a 
the smallest of the numbers a, and a. then we may write Af | t |¢ 
on the right-hand sides! of (10.75) and (410.76), these inequalities 
being true for all positive (respectively, negative) values of t satis- 
fying |t |< 5, where 4 is an arbitrary, sufficiently small positive 
number. 


* See Section 9.3. 
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Now we may rewrite relation (10.101) as follows: 


| e7l*VF (y) dy— Lie Oe! (z— 0) — 


HA 
o 
=4 [ifteto—fe+0 Hat 
0 
0 
+= if @+y—f(e—o) SX are 
-6 
+t \ f(x -+-t) Soe sag 
thes 
co -6 
> ' —0 in 2% 
_ [e+ j sine a — Le=9 j SM ay, (10.102) 


6 — 00 


Fix an arbitrary e > O and choose for it 6 > 0 so small that 
a 
Mo" =e (10.403) 


12 4° 


Evaluating the first two integrals on the right of (10.102) with the 
aid of inequalities (10.75) and (410.76) (with Af | t |* on the right- 


dhand sides of them) we have 
6 


|= | (f(@+t)—f(2+0 *atl< 


6 

<2 fife+o-fe+ni(t<4 feta 
0 0 

and quite similarly 
0 
|= \ie+-1e-0) SA at|< 
< ( M ( M5* 
<> J ie+0-1e-01 <3 (feist ae= 
-6 


from the last two inequalities and from (10.103) we get 


|= Uf (zt) —f (2+ 0)] 2 at| + 


0 
+|= ( Wero—se- 0 EE at|<s. 


(10.104) 
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To evaluate the third integral on the right of (10.102) we intro- 
duce the oti 


4 — ler when {¢ [26 
g(t) = “3 
0 when |t|<6d. 
Since g (t) € L, (—co, ©), by the corollary of Lemma 4 


0 


. . . 4 i 
lim \ g (i) sin Mt dt = lim > \ fict1) 2™ a =0, 


jt jad 
but this means that for the fixed arbitrary e > 0 there is A, such that 
= \ fier) Sh a | <= (with ADA). (10.105) 
{1{26 


Finally note that 


sex dt = | me at= | Sn dr 0 
—00 6 AS 


as 4. > oo. It follows that for the fixed arbitrary e > 0 and the 
given point xz there is A, such that 


ieee | seh a 4] Leao | Sale 
6 ar) 


(with A> A,). (10.106) 


Denote by A the largest of the numbers A, and A,. From relations 
(40.102) and (410.404) to (10.106) we concluded that 
N 
| [ Uf (y) dy —LETOEIE9) | <e (with ADA). 
my 


Thus the theorem is proved. 
Corollary. Equation (10.96) is clearly true if f (z) EL, (— 0, o) 
and if f{x) has at a given point x the right- and left-hand deriv- 


atives understood to be the limits of relations lim He+)—f (e+) 


0 t-0+0 t 
( tim LEO—FE—O 


Remark 2. The limit on the left of (10.96) may be written as an 
improper integral 
a et uf (y) dy, (10.107) 


a~ & 


cannot be understood to be the limit 
A” 


Jim [ e*¥7 (yy ay 
Lay ied \° 


as h’ and }” independently tend to —co and co respective] 
Section 10.6.3 below we shall write the improper integral (10.407 
instead of the limit (10.96), keeping in mind all the time that it ig 
used in the indicated sense. 

10.6.3. The direct and the inverse Fourier transformation. Writing 
the left-hand side of (10.96) as the improper integral (10.107) ang 
considering that the value of the function f (z) at a given point z 
is equal to a half-sum of the right- and left-hand limiting values 
we obtain the equation 


f (2) = 5 ( e''Y (y) dy (10.108) 
allowing f (x) to be found from its Fourier transform f (y) and often 
called the inverse Fourier transformation. With respect to this formu- 
Ja (10.90) using which the Fourier transform f (y) can be expressed 
in terms of f (z) itself is often called the direct Fourier transformation. 

By analogy with the Fourier trigonometric series we conclude 
that the Fourier transform is the analogue of the Fourier coefficient 
and that the inverse Fourier transformation (10.108) is the analogue 
of the expansion of a function into a Fourier trigonometric series. 

Consider the direct and the inverse Fourier transformation for 
two important special cases: (1) for the case where a function f (z) 
is even (i.e. satishes f (—xz) = f (z)) and (2) for the case where f (z) 
is odd (i.e. satisfies f(—z) = —f (z)). 

(1) If f(z) is an even function, then from formula (10.90) and 
from the Euler formula e'*” = cos zy + isin zy we get 


fiy)= \ cos ryf (xz) dr=2 ) f (x) cos zy dz. (40.109) 
co 0 


From formula (10.90) it follows in turn that the Fourier transform 
} (y) is also an even function of y. The inverse Fourier transforma- 
tion therefore takes the form 
(2) = a \ f (y) cos yx dy = — \ f (y) cos yx dy. (40.410) 
—o 0 


~ 
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en ee eee 
| ine transformation 
, 10.409) is often called direct Fourier cosine 
gone 10. 110) is inverse Fourier cosine ier, eg es 
"0) If f (z) is an odd function, then quite similarly from form re 
(10.90) and (10.108) we obtain the direct Fourier sine transforma 


f(y)=2 | { (x) sin zy dz 
0 


and the inverse Fourier sine transformation 


{ (z)=— \ f (y) sin yz dy. 
0 


Of fairly frequent occurrence in practice is the case where f (2) 
is given only on a half-line 0 <2 < oo. In this case we may at 
will extend that function to an even or an odd function on the half- 
line —oo < z < 0 and use for f (x) either a Fourier cosine transfor- 
mation or a Fourier sine transformation. 

Example. Consider on a half-line 0 St<o a function f (rz) = 
= e*, where a > 0. Extending that function to an even function 
on the half-line —oo<2z<0 we obtain the direct and inverse 
Fourier cosine transformations 


f(y)=2 | e** cos zy dz = 


, ae + y? 


(i (y) is sometimes called a Fourier cosine transform), 


9 [= =} 
{ (z)=—a { aor y=e™ (250). 
0 


Extending the same function to an odd function on the half- 
eS L200, ie. setting 


e~™ when z>0, 
{(z)= 2 Q when z=0Q, 
—e-l=! when 2<0Q, 


line 


We obtain the direct and inverse Fourier sine 


transformations 
*s =. = ae 
f = 2 az .: —— 2y 
(y) Je sin zy dz = err 
eens, 


* Recall that the integral | ¢ 


Way by two-fold ; 
se i a 4 integration by 


~* cos zy dz can he Computed in an ele 


mentary 
ng footie (see Chapter 6 of (1}). 
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(f (y) is sometimes called a@ Fourier sine transform), 


2 © ysinys e~* when z>0, 
f(z)= 1 \ “aty? 2 v=4 0 O when zr=0. 


10.6.4, Some further propertics of the Fourier transformation. We shall dis- 
cuss here some further properties of Fourier transformation, frequently encoun- 
tered in applications. 

Lemma 5. For some nonnegative integer k, let a function (1 + [2 {)®-f (xc) € 
€ L, (—o09, co). Then the Fourier transform (10. 90) of the function Y (x) is differen- 
liable h times with respect to the vartable y, it being possible to compute the dertvative 
with respect to y of any order m (m = 1, 2, ..., k) by differentiation uncer the 
integral (10.90), f.e. from the formula 


am = ( etsv (ayn dr (m=4, 2) oe, k 10.444 
Bo Tw)= | tev (ieym-s(eydz (mat, 2-5 8. (40.414) 


— 0 


Proof. From the inequality 
af efx. (fx)™ f (2) | < (14+ | 2 fhe] f(z) | 


true forany m (m = 0, 1, ..., &) and from the convergence of the improper 
©oO 


| pone etYf (2) 


integral \ (1 + [x fjk-j f (x) | dx by the Weierstrass test (Theorem 9.7) it 
om OD 
follows that the inte gral on the right of (10. M1) converges uniformly in y (on 
every closed interval) for any m = 0, 4, . By virtue of Theorem 9,40 
this ensures that there is a derivative with Tespect to y of any order m = 4, 2,. 
, k and that formula (10.141) is true. Thus the lemma is proved. 
"Lemma G6. Let a function f (x) have at every potnt x all derivatives up to order 
k > 1 inclusively, the function } (2) itself and its derivative ve of order k being absolute- 
ly “integrable onan infinite line and for any m = 0, 1,..., (kh —41 


tim (Se ams oi |= 0. (10.442) 


[x Jc dx™ 
Then for the Fourter transform i ty) of f (x) as J} y | +> c© we have the estimate 
f(y) 1=0(Ly 17). (10.143.) 
Proof. Consider for any 2 > 0 the integral 
. 
{ ptxy GE) gy, 
dzk 


Integrating it & times by parts we obtain the formula 


Fearn fee IP wt PLDT 


2, 
+ (— ik: yk | elxy} (z) dr. 
~2 
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Making in the equation obtained 4 approach co and considering that by (10.142) 
all substitutions vanish we get 


| ety S12) dz=(—iy)* efxy.f (zx) dz —=(—iy)k-f (y). 


Considering that by Lemma 4 the integral on the left of the last equation tends 
to zero as | y | -+ co we obtain the estimate (10.113). Thus the lemma is proved. 

Theorem 10.20.\Let a function f (x) and its second derivative be absolutely integra- 
ble on an infinite line {—oo, 00), the function f (z) itself and its first derivative 
tending to zero as |z | —- cc. Also let a function g (x) be absolutely integrable on 
(—cc, 00). Then the following equation ts true: 


| f@)e@) de=Z | Fae way (40.414) 


called the generalized Parseval formula or Plancherel* equation. (In (10.114) 
f (y) and g (y) are the Fourier transforms of f (z) and g (zx) respectively and g* (y) 
isa complex conjugate of g {y).) 

. By virtue of Theorem 10.19 at every point z 


Proo 
f (=a [ e~ixvf (y) dy, (10.445) 


the estimate | f(y) | < C (4 + | y $)~? being true by Lemma 6, ensuring that 
the integral on the right of (10.145) converges absolutely and uniformly (with 
respect to z) on the whole infinite line. 

Multiplying both sides of (10.115) by g (z) and integrating with respect to x 
between —? and 2% we have 


f (z) g (2) ont | g (z) [ ( e~Exvf (y) ay | d. (40.146). 
—4, —co 


1250 
~s, | 


As pointed out above the integral (10.415) converges uniformly in z and so we 
may change the order of integration with respect to x and y on the right of 
(10.116) to get 


2, co 2 
| r@e@az—a | [| ee eae] Fey (10.417) 
~?, —~o —) 


(* stands for complex conjugation). 
By virtue of the inequality 


4, 
, 4 
in etx¥g (2) az | 


and the Weierstrass test the integral on the right of (10.117) converges uniformly 
with respect to 7on the infinite line —co < 2 < oo. In (10.117) therefore we 
can proceed to the limit as 7, o, while proceeding on the right of (10.417) 
to the limit under the integral. Thus the theorem is proved. 


* M. Plancherel (b. 1885) is a French mathematician. 


APMIS { lg (z) f-dz-cC (4+ y I)? 
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10.7, MULTIPLE FOURIER TRIGONOMETRIC SERIES 
AND INTEGRALS 


10.7.4. A multiple Fourier trigonometric Series and its rectangular 
and spherical partial sums. Let a function of V variables f (z,, 2.,... 
..-, Zy) be defined and integrable in an MN-dimensional cube 
—n<z,aQn (k =1, 2, ..., NV). We shall denote that cube by 
I]. It is convenient to write the multiple trigonometric serics of 
such a function straight in complex form, using the concept of 
a scalar product of two V-dimensional vectors to abridge the nota- 
tion. 

Let x = (2), Zo, ..., Zy) be a vector with arbitrary real-valued 
coordinates 2, to, ..., Zy, and Jet n= (n,, ne, ..., ns) be 
a vector with integral coordinates n,, M., ..., Ny. 

Lhe multiple Fourier trigonometric series of a function f (x) = 
= f (1%, Le, .. +, Fy) iS a series of the form 


oT ~ 7 
Noe NM faeen on), (10.118) 
Ty s=— 00 1 y-==—00 


where the numbers hn called Fourier coefficients are defined by the 
equations 


fa=fayn ty = (22) [ eee l fan. eoey Yn) x 
“on ° 


Kx emt FYNTN) dy. dyn, (10.419) 


and (#n) denotes a scalar product of vectors x and n equal to z,n, + 
+... + IyNy. 

Of course a multiple Fourier trigonometric series (10.1418) may 
be regarded as a Fourier series with respect to an orthonormal system 
(in an -dimensional cube II)* formed using all possible products 
of elements of the one-dimensional trigonometric system in variables 
Ly, To, -- +, Ly respectively. It is customary to call that orthonormal] 
system a multiple trigonometric system. 

For this, as for any orthonormal system, there is a Bessel inequality 
which is of the form 


oO oo 


See De lhe (... | Pe ety) dey... den, 


n)==—co Ny II 
(10.420) 
where f (z,, ..., Zy) is any funclion continuous in an N-dimen- 
sional cube II. 


* A scalar product of any two functions is defined to be an integral over 1 
of the product of those functions. 
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Consider the problem of convergence of a multiple Fourier trig- 
onometric series. [f that series does not converge at a given point 
Z= (yy 2. ey Ly) Absolutely, then its convergence (by Thoorem 13.10 
(Riemann) of {1]) depends on the sequence of its terms (or equiv- 
alently on the order of summation over the indices m,, Me, ..., My). 

Of wide uso are two methods of summing a multiple Fourier trig- 
onometric series, the spherical and the reclangular one. 

The spherical partial sums of a multiple Fourier trigonometric 
series an are sums of the form 


5, (2, f= sin enon) 
Inte 


taken over all integral values n,, Mae woo, Ny Satisfying the con- 
dition [a [= Vni + ni +o <A. 
A multiple Fourter trigonometric series (10.4118) is said to be si:m- 


mable at a given point x by the spherical method if there is a limit 
lim S A (x, f) at that point. 


The rectangular partial sums of a multiple Fourier trigonometric 
series (10.418) are sums of the form 


ei wg 
Sime. .mtyy (x, fp= ™ ove » fre ten, 
; Yhycs—~ rng Nap et— I y- 


A multiple Fourier trigonometric series (10.118) is said to be sum- 
moble at a given point x by the rectangular (or Princegeime) method 
if there is a limit 


lim Smet, oo Mt pe (zx, f) 


My oc ’ 
Mm, 09 


ee Fae 


at that point (as cach. of the indices m,, Mz, ..., My lends to infinity). 

The summation methods both have their advantages and draw- 
backs. In considering a multiple Fourier trigonometric series with 
respect to an orthonormal] system it is natural to arrange its terms 
in the order of increasing { n | and deal with spherical partial surns. 

Rectangular partial sums are used in investigating the behaviour 
of multiple power series near the boundary of the domain of conver- 
gence. It should be noted that the definition of the sum of a series 
ag the limit of rectangular sums (in contrast to the definition based 
on the limit of spherical sums) imposes no constraints on the infinite 
sel of partial sums -of that kind. 

Before formulating the conditions for the convergence of a multiple 
Fourier trigonometric series we define some smoothness characteris- 
tics of a function of N variables. 


23—01664 
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10.7.2. The modulus of continuity and Holder classes for a function 
of NV variables. Let a function 7 (x) = f (2, z. ....Zy) be dehned 
and continuous in an -dimensional domain D. 

Definition 1. For every § > 0, the modulus of continuity of a fune- 
tion f(x) in a domain D is the supremum of the absolute value of the 
difference | jf (x') — jf (x") | on the set of all points x’ and x" in D 
whose distance Q(z’, x") from each other is less than &. 

We shall denote the modulus of continuity of f (z) inD by w (6, jf). 

Definition 2. For any ~ of the half-open interval O<O 2x <1, 
we Shall say that a function f(x) belongs in a domain D to a Holder 
class C* with an index ~ and write f(z) € C“(D) if the modulus of 
continuity of f (x) in D is of order w (6, f) = 0 (8*) trhenO<z<cl 
and w (6, f) = O (8) when wv = 1. 

Now let a@ be any (not necessarily whole) positive number. We can 
always represent it as @ =r-—+ x, where r is an integer and ~ is in 
the half-open interval O< zx <1. 

Definition 3. We shall say that a function j (x) belongs in a de- 
main D to a Holder class C* with an index a > 0 and write f (r)€ 
€ C°(D) if all partial derivatives of f (2) of order r are continuous in D 
and every partial derivative of order r belongs to the class C® (D) intro- 
duced in Definition 2. 

10.7.3. Conditions for convergence of a multiple Fourier trigono- 
metric Series. We first establish the simplest conditions for absolute 
and uniform convergence of a multiple Fourier trigonometric series. 

Theorem 10.21. If a junction f (x) is periodically (with period 
2x in each of the variables) extended to the whole of a space E* and 
has in E*® continuous derivatives of order s = {N/2) + 4, where {N/2} 
is the integral part of the number N/2, then the multiple Fourier trig- 
onometric series of f (x) converges (to that function) absolutely and 


uniformly in the whole of EX®. 
“ 
| the Fourier cocfh- 
=p Jn 


with index n=(n,,...,Nx)e Integrit- 


Proof. Let us agree to denote by ( 


Omf 


€ zi. 


i 


Or, 


cient of a derivative 


ing by parts we get ( 
y 
that | Ss (=o) |=. f (jmp Jt... + fm.) and therefore 


} =in,f, (for any k=1,2,...,N), 50 
ee 


ho C=} 
fim(=(Im f+... thax! DS (Zo) J. (10.121) 
ke} ; 


Formula (10.121) is true not only for the function f but also for every 
partial derivative of f up to the order (s — 1) inclusively. 
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This immediately yields the relation 


(fa [<([my | +... +] ey 17° yy \(—“..),| 
rt... Ory 


Sit . -FSpyyp=5 


(10.422) 


the sum on the right of which is taken over all nonnegative integers 
$1, So, » + +) Sy Satisfying s, + sp +... + Sy = s(so that the num- 
ber of terms in the sum is equal to NV*). From (10.422) it follows 
in turn that* 


Lin <a [Eee ED ty IE 


a> |(—“L_), 


5} 
Sree tS Oz, vee N 


Considering that sat te, where e=1 for an even NV and 


c= 5 for an odd NW and that 


([nyJto.. +] ny [7 %=([my | +... +) ay [yor vr< 


—{ 42k 
Slr, | Nye. [ry] NY 
we get from (40.123) 
a 4 4 oe 1-3 
lin ISH Ms | Nv i.e [ny | 4+ 
Ns on ; 
ve 2: ( s a = aI ° (10.124) 
Site. tS ys 0x} eee Or 


For the absolute and uniform convergence of a multiple Fourier 
trigonometric series (10.1418) it suffices (by virtue of the Weierstrass 
test) to prove the convergence of the number series 


oo co 


DY eee DD fel 


ny=— oo N pF 00 


Iajorizing it, but (by inequality (10.124)) the convergence of this 
Jast series is a direct consequence of the convergence of the. num- 


2 2 
* Wo use the inequalities {a{-{ | <T4+> and ({a,[-+...+)ap I)? 


<p(aj+...-+ 45). 
23% 
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oC Qr 
. % ~{--—— 
ber series “jn, | % for any & and the convergence of the 
Ty, == ~ OC 
series 
os 2 ~ 
yy > |(——1— l 
Ce ee) x oy n 
Pe Tl yet OC dx; eee OI» 


for any Sy, Sq,+--,Sy that follows from the Besse] inequality 


g 
(10.120) for the continuous function a —. 
az? eee dx ¥ 

The fact that it is to the function f (x) that the multiple Fourier 
trigonometric series (10.148) converges follows from the completc- 
ness of a multiple trigonometric system*. Indeed, if tho series 
(10.418) uniformly converged to some function g (x), then from 
the possibility of 1erm-by-term integration of such a series it would 
follow that all Fourier coefficients of g (z) coincided with the cor- 
responding Fourier coefficients of f(z). But then the difference 
(f(z) — g(x)} would be orthogonal to all the elements of tho 
multiple trigonometric system and (by the completeness of the 
system) equal to zero. Thus the theorem is proved. 

Remark 1. Theorem 10.21 may be revised. The following state- 
ment is true**: if a function f (x) is periodic in each of the variables 
(with period 2x) and belongsin a space EN toa Hélder class C*%, with 
a >> N/2, then the multiple Fourier trigonometric series of f (a) con- 
verges (to that function) absolutely and uniformly in the whole space E%. 

Determining conditions for nonabsolute convergence of a multiple 
trigonometric series requires a finer technique. 


We shall state withoul proof the conditions for the summability of a mul- 
tiple Fourier trigonometric series by the spherical and the rectangular method. 
Theorem 10.22. If a function f (ry, .. +) Tx) of N > 2 variables is periodic in 


each of the variables (with period 2x) and belongs in a space E™ toa Hélder class C%, 
with @ D> (N — 1)/2, then the spherical partial sums of the multiple Fourier (riga- 
nometric series of { (21, .~ «, Ty) converge to that function uniformly in the whole 


space E Nan, 


* The completeness of the multiple trigonometric system follows immediately 
from the completeness of the constituent one-dimensional trigonometric systems 
the product of which it is. 

** This statement is very simply obtained from Lemma 3.1 proved in the 
paper by V.A. flyin and Sh.A. Alimov. "Conditions for the converrence of 
spectral decompositions that correspond to self-adjoint extensions of elliptic 
operators, I" (Differentsialnye Uravneniya, Vol. 7, No. 4 (1974), pp. 670-710). 

**7 This theorem follows from the more general statements proved in the 
paper “Problems of Jocalization and convergence of Fourier scries in fundamental 
systems of functions of the Laplacian operator” by V.A. Hyin (Uspekhi Matema- 
ticheskikh Nauk, Vol. 23, 2 (1968), pp. 61-120) and in the paper cited in the 
preceding footnote. 
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Theorem 10.23. For any positive a less than (N—1)/2 and any point x of an 
N-dimensional cube II there is a function f (21, Yay.» +) Fy) Of N > 2 variables 
periodic in each of the variables (with period 2), belonging in EY to a class c™, 
vanishing in some §-neighbourhood of 2) and such that the spherical partial sums of 
the multiple Fourier trigonometric series of that function have no limit at x9*, 

Theorems 10.22 and 10.23 establish final conditions (in Hélder classes C%) 
for the convergence of the spherical partial sums of a periodic function f (z,,.. - 
. ++; Zy). According to those theorems there is uniform convergence of spherical 
partial sums when a > (N — 1{)/2, but when @ < (V — 1)/2 even the principle 
of localization fails to hold for them (however smooth a function f may be in 
a neighbourhood of 2, its belonging to a class C% (E*) when a < (N — 4)/2 
does not ensure that its spherical partial sums converge at 2»). 

Final conditions (in Holder classes C™) for the convergence of the rectangular 
partial sums of a multiple Fourier trigonometric series have been established by 
L.V. Zhizhiashvili**. 

Theorem 10.24. If a function f (xy, . . +; Ly) of N variables is periodic in each 
of them (with period 2n) and belongs in E** to a class C™, with any a > 0, then 
the rectangular partial sums of the multiple Fourier trigonometric series of 
f (%, . . ., Ly) converge (to that function) uniformly tn EN, 

Remark 2, Note that as far back as 1928 L. Tonelli*** established that the 
continuity alone of a function f (x, ..., 2y) of N > 2 variables fails to ensure 
not only the uniform convergence but also the principle of localization of the 
rectangular partial sums of the multiple Fourier trigonometric series of the 
function (there is a function periodic in each of the variables (with period 2m), 
continuous in EN, vanishing in some 5-neighbourhood of a given point zy and 
such that the rectangular partial sums of that function diverge at zp). 


10.7.4. On the expansion of a function into an V-fold multiple Four- 
ier integral. Let a function of N > 2 variables, f (%, ..., ty) = 
= f(x), allow the existence of an improper integral 


{ = \ f(z see, Dy) ry... AEy. (40.125) 
EN 


The Fourier transform of such a function is the quantity 


(Yar ees Yn) =f (y) = \ cee el?) F (24, 164, Ly) dr, ... dry. 
EN 
In close analogy with Lemma 4 it can be proved that f (y) is 
a continuous function of y everywhere in EH" and tends to zero as 


lyj=Vyet... + yy oo. 
The limit 


lim |... \ F(Uss veer Vw) enTH™ VI dy, 0. Gy (10.426) 


Aw * yc 


* This theorem is a special case of a more general statement proved _in 
Chapter 3 of the paper by V.A. Ilyin cited in the preceding footnote. 
** “On Conjugate Functions and Trigonometric Series” by L.V. Zhizhiashvili. 
Thesis for a Doctor’s degree. Moscow: MGU (1967). 
«** T,, Tonelli (1885-1946) is an Italian mathematician. 
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(provided that this limit exists) is called the erpansion of the function 
f(x) into an N-fold multiple Fourier integral. 


The following two statements are true*. 

1°. If a function of N > 2 variables, f (x, . .., ry), vanishes outside some 
bounded domain and belongs in the whole of a space E*® to a Holder class C*, 
with @ > (N — 1)/2, then the expansion of that function into an N-fold mul- 
tiple Fourier integral (10.126) converges (to that function) uniformly in the 
whole space E%, . 

2°. For any positive a less than (N’ — 1)/2 and any point 2 there is a function 
of N > 2 variables, { (21, . . ., Zy), different from zero only in a bounded domain, 
belonging in E* to a class C%, vanishing in some 5-neighbourhood of zp and such 
that for that function the limit (10.126) docs not exist at the point x9. 

Statements 1° and 2° establish final conditions (in Hélder classes ce} for the 
convergence of the expansion into an N-fold multiple Fourier integral of any 
function equal to zero outside some bounded domain of a space E*. According 
to those statements there is a uniform convergence (in any bounded domain) of 
the expansion into an N-fold multiple Fourier integral when @ > (N — 1)/2, 
but when «a < (N — 1)}/2 even the principle of localization fails to hold for the 
expansion into an N-fold multiple Fouricr integral (however smooth a function f 


may be in a neighbourhood of a point 2p, its belonging in the whole of E* to 


a class C%, with a@ -< (N — 1)/2, does not ensure that the expansion of that 
function into an N-fold multiple Fourier integral converges at z). 


* Both statements follow from the more general ones proved in the paper by 
Sh.A. Alimov and V.A. Ilyin “Conditions for the convergence of spectral de- 
compositions that correspond to self-adjoint extensions of elliptic operators, II” 
(Differentsialnye Uravneniya, Vol. 7, No. 5 (1971), pp. 851-882). 


CHAPTER 11 


HILBERT SPACE 


In this chapter we study an important subclass of infinite-dimen- 
sional Euclidean spaces, the so-called Hilbert spaces. 

We establish a special representation, important for applications, 
of any linear function of the elements of such a space (this function 
is generally called a linear functional), we also establish that it is 
possible to choose a subsequence, convergent in some weaker sense, of 
any infinite set of elements of a Hilbert space, bounded in the norm 
{this property is termed a weak compactness of a ball in a Hilbert 
space). 

Particular attention is given to the study of orthonormal systems 
of elements of a Hilbert space. We establish the equivalence for 
such systems of the concepts of closure and completeness intro- 
duced in Section 10.2. and prove the famous Riesz-Fisher theorem 
according to which any sequence of numbers, the series of whose 
squares converges, is a sequence of Fourier coefficients of some ele- 
ment of a Hilbert space in an expansion with respect to a preassigned 
orthonormal system of elements of that space. In the last section 
we prove that the so-called completely continuous self-adjoint oper- 
ators acting in Hilbert space have eigenvalues. 


41.1. THE SPACE ]2 


11.1.4. The space 17. Consider a set whose elements are all possible 
sequences of real numbers (2, 22, ..-, Zp, -..) Such that the 
series made up of the squares of those numbers 


3} af (14.4) 
koi 


is convergent. The elements of such a set will be denoted (as vectors) 
by som bold Latin letters: #= (2,, Zo). - 5 Spy - + +)y Y= (Yar Yours 
- ...), and so on. The numbers 21, 7%, ..., Tay ... will 
be called the coordinates of an element « = (21, Lo, .. .). 
We define the operations of addition of elements and of multipli. 
cation of elements by real numbers. A sum of two elerrents x = 
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= (2%), Toys evy Tny ss +) and y = (Yi. Yor -- ++ Yay - + -) is an ele- 
ment 2 == (2) Yy, By Yay - ee) Tn HYUn,-.-)*. Tt will be desig- 
natedz =a-+ y. A product ofan element x = (ry, Bay seed My ---) 
by a real number 4 is an element designated Ax or xi and equal to 
(1.25, AZo, . «+, ATn,.~.). It is easy to verify that the set we have 
defined is a linear space, i.e. to verify that all the axioms relating 
to the addition of elements and to the multiplication of elements 
by real numbers are true** 

We now introduce in that set a scalar product of any two clements 
@ == (Zy, Lo, »--) In» ...) and y = (Yy, Yo)... Yn» . ~~), defining 
it to be the sum of a series*** 

oo 
»! THY ke 
k=! 

oo 


So we set (7, y) = Ss Zny;,. Itis easy to verify that all the four 
=! 


axioms of a scalar product are true. (They can be found in Section 
10.4, and verification of their validity for the space under study 
will be left to the reader.) 

Thus the set we have introduced is a Euclidean space. Following 
the established tradition, it will be designated 1*. 

We introduce in 2*, as in any Euclidean space, the norm of every 
element x = (2, Z, ---, Lpy -- .), Setting it equal to 


lo ||=V@, 2) = Vs rhe (41.2) 


(Since the series (11.1) is convergent, such a definition is meaningful). 
As usual, two elements of /? are said to be orthogonal if their 
scalar product is equal to zero. 
Recall that an orthonormal system in a Euclidean space is a sequence 
of elements {e,} of that space satisfying two requirements: (1) any 


foe) 
* The convergence of the scries » (x, + yp,)* follows immediately from 
h=1 


fa @) 
the inequality (za-+ yn)? <22}-+ 2y? and from the convergence of \) zf 
hes 
oo 
and 2 Vie 


“ The formulation of the axioms ofalinear space can he found in any 
course of linear algebra. 
*** The convergence of this series follows from the inequality lrnun |X 


<> z z2 ty?) and from the convergence of 5! zp and > Uj. 
k=4 ke J 
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two elements of the sequence should be orthogonal; (2) the norm of 
each element should be equal to unity. 

We prove that there is a closed (and therefore, by Theorem 10.7, 
complete) orthonormal system in /**. We show that such a system 
is a sequence of elements 


é,= (4, 0, 0, eee, 0, wse)s 
en=(0, 1, 0, «+e, 0, oe), 
an 0, 1, ene, 0, weedy 


(41.3) 


That this system is orthonormal is obvious (the norm (14.2) for 
every element e; is equal to unity and the scalar product of any two 
elements is an infinite sum of products each of which is equal to 
zero). To prove the closure of the orthonormal system (11.3) it suf- 
fices to establish that for any element « = (a, Do, .-.; Tn, -- -) 
of 22 its Fourier series with respect to the system (411.3) converges 
to it in the norm of /?**, 

Since the Fourier coefficients (x, e,) of an element zx coincide 
with its coordinates z;, the nth partial sum of its Fourier series is 

Tl 


equal to S) z,e, and it is sufficient for us to prove that 
h=1 


lim || >, 1,p— & || = (11.4) 


N-~co 


But from the definition of the norm (41.2), from the orthonormality 
of asystem {e,} and from the properties of a scalar product it follows 


that 


nm n , n 
2 ne—olP=( > Ty€,p—Z, ; re.) = 
h=4 k=i 


Tt 
a 2) an (Ons 2) + Ux IP= ila |e Six 


k=i 

= ¢ oa 

1 e > 9 
=) ai— D>) th= 2 Zh, 

k=l k=} =n-+1 


so that the relation (11.4) follows from the convergence of the series 


4 
( VDL. 2. The general form of a linear functional in l*. We shall 
discuss functions whose independent variables are elements of —[* 


pe 

* For the definitions of completeness and closure of an orthonormal system 
cee Section 10.2. 
~'*#* For then any element + of 1* can be approximated to an unlimited precision 


in the norm of / by partial sums of that Fourier series. 
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and whose values are real numbers. Such functions are usually called 
functional (defined in a space [*). 
More precisely, our aim is to make a detailed study of the simplest 
functional defined in 2[, the so-called Jinear functional. 
Definition 1. A functional 1 (x) defined in a space [° is said to be 
linear if for any elements x and y of l= and any real numbers a and B 


L(ax -+- By) = al (x) + Bl (y). 


Let x) be an arbitrary element of 7°. To geometrize the termi- 
nology we shall often call the clement zx, a point of /*. 

Definition 2. An arbitrary functional I(x) defined in F° is said 
to be continuous al a point x, of 1* if for any sequence of elements {x,} 
of [* converging in the norm of I° to the element x, the number sequence 
1 (x,) converges to l (Zp). 

Definition 3. A functional 1 (x) is said lo be continuous if if is 
continuous at each point x of lL’. 

Notice at once that in the case of a linear functional I(x) continuity 
at least at one point x, implies continuity at each point x of I*. Indeed, 
let a linear functional be continuous at a point x» and let x be an 
arbitrary point of /*. Denote by {z,} an arbitrary sequence of cle- 
ments of /* converging in the norm of /? to 2. Then the sequence {a+ 
+ x, — x} converges in the norm of [* to x, and from the conti- 
nuity of the functional at z, it follows that 


1 (xy + 2, — xz) +1 (@) when n—->oo. (411.5) 


But from the linearity of the functional it follows that 1 (a + 
+a, —2) = 1(x2) + 1 (x,) — 1 (x). From the last equation and 
from (11.5) we get 2 (2,) +1 (x) as m — 00, which just means that 
the functional is continuous at z. 

Definition 4. A functional | (x) is said to be bounded if there is 
a constant C such that for all elements x of I° 


(1 (x) |<C Ia fl. (11.6) 


Theorem 11.1. For a linear functional l (x) to be continuous it is 
necessary and sufficient that it should be bounded. 

Proof. (1) Necessity. Let a linear functional J(a@) be continu- 
ous. Suppose there is no constant C satisfying inequality (11.6). 
Then there is a sequence of nonzero elements x7, such that |/(z,)|> 


= n* || 2p it set Yn = 
— n|}Zp Il 
as moo, by the continuity of the functional (y,)— 1 (0) =0 as 


° 1 
En. Since ayn —Ofl=llyn ||=—— 0 


* Tor a zero element 0 inequality (11.6) is true for any constant C, for by 
the linearity of the functional 7 (0) = 1 (Or) = 0-1 (x) = 0. 
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1 


n-+ oo, but this contradicts the inequality | (gn) |= 737 lanl 
n 


x 


~|l(z,) =n. Thus the necessity is proved. 

(2) Sufficiency. Let a linear functional 1 (2) be bounded, i.e. let 
there be a constant C such that for all elements x inequality (11.6) 
is true. Also let 29 be an arbitrary point of J* and let {z,} be an 
arbitrary sequence of elements of /* converging in the norm of ?? 
to Z. Then by the linearity of the functional / (2,) — / (@%) = 
= 1 (2, — 24), so that on the basis of (11.6) | 2 (@,) — L (a) | = 
= | 1 (a, — a) | <C || a, — xol|. From the last inequality it fol- 
lows that J (a,) —-1 (a) as n oo. Thus the sufficiency is proved. 

The theorem proved allows us to introduce the norm of a linear 
continuous functional. 


td 


Definition 5. The norm of a linear continuous functional | (x) 


is the supremum of a relation 5 = | on the set of all elements x 
of a space 1?. 
It will be designated || Z |j. 
So by definition 
[L(x 
l = 41.7 
haere ea ae 


The following main theorem is true. 

Theorem 11.2 (Riesz’s theorem). For every linear continuous 
functional 1 (a) there is one and only one element a of I? such that for 
all elements x of I? 


l (x) = (a, 2), (14.8) 
with || 1 i} = {ha IL. 
Proof. Let {e,} be a closed orthonormal system (141.3), a,= 


=1U(e,) (k=41, 2,...). We show that the sequence of real num- 
bers (a;, @, ...,; @n) is an element of /?, i.e. show that the series 
>) af converges. 
k=t 
n 
For any n, set Sy= >) dpep- 
k= 


Then by the linearity of the functional 


L(Sn)== 2; apt (en) = 2) ak = |] Sn li? (14.9) 


On the other hand, from Theorem 14.1 and from the definition 
of the norm of a linear continuous functional (411.7) it follows that 


(2S, PSHE UMS, He (14.40) 
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From (141.9) and (11.10) we get |] S, [|< |} 2 |] or equivalently 


pe 
 at<IUIP- (14.41) 
The last inequality true for any » proves that the series >) af con- 
heat 


verges, i.e. that the sequence (a,, @o,..., Gn; .-.) iS some element 
of J? which we denote by a. 

Now let a = (2, Yo, ..-; Zp, .. -) be an arbitrary element of /’. 
Then by the closure of an orthonormal system (11.3) a partial sum 


nn 
of a Fourier series > x,,e; converges in the norm of [*to ras n >. 


ho 
By the continuity of the functional it follows that 
wt 
L( >* 2,e,) l(a) as n—> co. 
h=1 


But from the linearity of the functional and from the equation 
ay, = l (e;) we get 
n n nN 
l (2 tpe,) = > pl (en) = 2, LpAp. 


——- — 


So we have proved that 
n 
lim \ 


n-oo h=1 


LRay, = l (x), 


but this means that we have established equation (11.8) with a 
uniquely determined element a whose coordinates are equal to /(e;). 


It remains to show that |] 2 || = || @ ||. It is|immediate from in- 
equality (41.141) true for any 7 that 
Hea ll< UZ il. (44.42) 


On the other hand, from equation (14.8) we have already proved 
and from the Cauchy-Buniakowski inequality” | (a, z)| < }{ a]j-|Jz]| 
we get |Z (zx) | < Jl a |l- || z |] from which by the definition of the 
norm (141.7) it follows that 


Zi<ell | (11.43) 


From (14.12) and (11.13) we conclude that |] / |} = ]la@ |]. Thus 
the proof of the theorem is complete. 
The theorem establishes a general form of any linear continuous 


Functional in 7 


* By Theorem 10.4 the Cauchy-Buniakowski inequality is true for any two 
elements of any Euclidean space. 
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11.1.3. On the weak compactness of a set bounded in the norm of /?. 

Definition I. A set E of elements of I? is said to be bounded (or 
bounded in the norm) if there is a constant M of elements such that 
la l]< M for all elements x of E. 

Definition 2. An infinite set E of elements of l* is said to be com- 
pact if we can choose a subsequence {p,} of any sequence of elements 
{x,) of E converging in the norm of -. 

It is obvious that any compact set E of [ elements is bounded*. 

In a finite-dimensional Euclidean space the converse is also true: 
any bounded set E containing an infinite number of elements is com- 
pact (the Bolzano-Weirstrass theorem). But in an infinite-dimensional 
space, such as J°, the boundedness of an infinite set of elements of £ 
no longer implies its compactness. 

For instance, the set {e,} of all the elements of an orthonormal 
system (11.3) is bounded (for the norms of all elements are equal 
to unity), but it is not compact (since for a sequence of elements 
to converge in the norm of /* it is necessary that the norm of the 
difference of two elements with integers k and k + 1 should con- 
verge to zero as k —~0o, and for any subsequence of the elements 
of (44.3) |] en — er I? = ll en I? + ll er I? = 2 given any & and / 
not equal to each other). 

It is natural to try to introduce the concept of compactness of 
a set in a weaker sense (than in Definition 2) so that any bounded 
set (containing an infinite number of elements) should turn out to 
be compact in that weak sense. 

Definition 3.%A sequence {x,} of elements of a space 1° is said to 
converge weakly to an element x, of that space if for any element 
a of 7? 


(t,, @) —>-(%), @) aS N—->OW. 


Notice that the convergence of {z,} to x, in the norm of /? and 
the Cauchy-Buniakowski inequality imply the weak convergence ol 
{tn} 10. %, since | (#,, @) — (%, @) |= |r —% als 
<V Ix, — x {f- {| @ |] for any element a. The weak convergence of 
{x,} to 2 does not in general imply the convergence of {x,} to x 
in the norm of /*. For instance, the sequence {e,,} of all the elements 
of an orthonormal] system (11.3) converges weakly to a zero element 
0, becausegfor any element aof /* we have the Bessel inequality** 


end i 
» (en, a)? <*|] @ | according to which (e,, a) > (0, a) = 0 as 
k=i ' 


* Indeed, the boundedness of # would imply the existence of a sequence of 
elements of E for which the sequence of norms is infinitely large. Any sub- 
sequence of such a sequence diverges in the norm of J°, which contradicts the 
compactness condition of Z. 

** By Theorem 10.4 the Besse) inequality is true for every element and any 
orthonormal system in an arbitrary Euclidean space. 
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n-—»oco. Moreover, it has been proved above that {e,} does not 
converge in the norm of /. 

Convergence in the norm of /* (in contrast to weak convergence) 
is often called sirong convergence. 

Definition 4. An infinite set £ of elements of I- is said to be weakly 
compact if we can choose a weakly convergent subsequence of any sequence 
of elements {x,} of E. 

The following fundamental theorem is true. 

Theorem 11.8. Any bounded set of an infinite number of elements 
in 2° is weakly compact. 

Proof. Let & be an arbitrary bounded subset of F° containing 
an infinite number of elements and let {z,} be an arbitrary sequence 
of elements of &. The condition of boundedness of / allows us to 
state that || a, || << 47, _Where Mois some constant. But then it 


follows from || 2, ||* = 5 zn that for any & the number sequence 
h=1 


of the éth coordinates z,), of the elements z, is bounded. Therefore, 
by the Bolzano-Weierstrass theorem (see Theorem 3.3 in (1]) we can 
choose from {x,} a subsequence of elements {2} such that the 
first coordinates of those elements form a convergent number sequence 
and then we can choose from {z}!’} a subsequence of elements {z;;'} 
such that both the first and the second coordinates of those elements 
form convergent number sequences and so on. After / steps we choose 
a subsequence of elements {2} in which each of the first / coordi- 
nates form a convergent number sequence. 

Set yn = xy”. It is obvious that {y,}is a subsequence of the orig- 
inal sequence of elements {z,} and that the sequence formed by 
any coordinate of the elements y, is a convergent number sequence, 
icc. if yn = (Ynis Yner - ++) Ynh> - + -) then for every / the sequence 
of Yn, CONVerges as n — oo. We denote by &, the limit of a sequence 
of the Ath coordinates of elements y,, i.c. set E, = lim yp), (kK = 1, 


2, ...) and show that the sequence (Ex, € Eny » + ey En, es ~) IS SOME 


element of J*, i.e. show that the series y &}, converges. Since |] yp I] < 


= 


< A for every 7, we have for every 7 


~ S yan <A? (14.44) 


ro | 


and clearly 


N 
SM y3n< M2 (14.15) 
i | 


(for any fixed N and for every n). 
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N 
Proceeding in (11.15) to the limit asm—> co we get >’ ER< a2 


for any NV, and this means that the sequence (&, &, ..., &, ...) 
is some element of J? which we denote by & 

It remains to prove that {y,} weakly converges to that ele- 
ment i.e. that for any element @ = (a4, do, ..+, Gp, .-.) of 
lim (y;, @) = (§, @) or equivalently 


NO 

ras a 
lim “Yanan = oy Epp. 
noo ond k= 


By virtue of the fact that lim y,, = &, and by the theorem on pas- 


N+ OO 
sage to the limit term by term (see Theorem 1.6) it suffices to prove 
that the series 


oO 


» Ynk* (11.16) 
h=4 
converges uniformly with respect to every nm. Fix an arbitrary 


e>0. The convergence of >} a? implies the existence of mg, such 
k=1 
that 


m+p 5 

YY aS (14.17) 
k=m-+ 1 
for every ml=m, and every natural p (p=1, 2, ...). 


Applying to the remainder of the series (11.16) the Cauchy-Bunia- 
kowski inequality for the sums* 


m+p mm m+P 1/2 
| » UnrGp I< [> LJ Ynk » aj,\ 
h=m-+1 h=m+i  - h=m-+4 


and using inequalities (11.14) and (14.17) we get 


m+p 

| >) Unk@p | <e 

h=m-+1 
for every m >> mp, every natural p and for every 7 at once. But this 
means that (441.16) converges uniformly with respect to every n. 
Thus the theorem is proved. 

It has numerous applications. In particular, it is widely used 
in the theory of variational methods of solving problems in mathem- 
atical physics. 


* This inequality has been established in Supplement 1 to Chapter 10 of [4]. 
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11.2. THE SPACE [2 


11.2.4. The simplest properties of the space L*. We are already 
familiar with the space Z* from the study of classes LD’, with p > 1, 
in Section 8.4.7. 

Recall that a space L* (Z) is the set of all functions {f (x)} such 
that every function f (z) is measurablo on a sot £ and overy function 
f (z) is summable (i.c. integrable in the Lebesgue senso) on L. 
We do not distinguish between functions equivalent on /, consider- 
ing them to be a single element of L* (2). 

L*(Z) is briefly called a space of functions with a square summable 
(on a set £). 

Note at once that all the integrals in this section are understood 
in the Lebesgue sense and that a sot / is understood to bo a measur- 
able set of positive finite measure on an infinite line, although the 
entire theory to be presented could be extended without any compli- 
cations to include the case of an arbitrary set of positive measure 
Ein a space of any number rn of dimensions. 

It was established in Section 8.4.7 that a space J* (/) is a nor- 
malized linear space with the norm of any element f(z) of the form 


1/2 


nril=(\ P@az)™. (11.48) 
Bb 


The space L* (2) differs substantially from all other spaces L? (7), 
with p = 2, in that L* (2) isa Euclidean space with a scalar product 
of any two elements f (z)*and g (x) of the form* 


(f, a= | f(x) ¢ (2) de. (11.49) 
i 

The validity in Z°(2) of all the four axioms of a scalar product** 
easily follows from the independence of the product f (x) g (z) 
of the order of the factors, from the linear properties of the integral] 
and from the equivalence to zero of a measurable, summahle and 
nonnegative function f* (z). 

Also notice that from (41.18) and (44.19) it follows that in LZ? 
(as in any Euclidean space) the norm and scalar product are related 
by 


KA = VES). 


* For the definition of a Euclidean space and a scalar product sec Section 144. 
**« The axioms of a scalar product can be found in Section 10.1. 
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Finally, recall that it has been proved in Section 8.4.7 that the 
space L? (Z) is complete”. 

We now proceed to show the more profound properties of L* (£). 

11.2.2. The separability of the space Z*. First consider an arbitrary 
normalized linear space R. 

Definition 1. A set M of elements of a normalized linear space R is 
said to be everywhere dense (or dense in A) if for any element f of R 
we can choose a sequence of elements {f,} of M converging in the norm 
of R to jf. 

Definition 2. A normalized linear space R is said to be separable 
if in it there is a dense set of elemenis M countable everywhere. 

Our aim here is to prove the separability of L’. 

Theorem 11.4. A set of functions continuous on E is everywhere 
dense in L? (£). 

Proof. Let f (z) be an arbitrary function in L? (#). We may assume 
without loss of generality that f(z) > 0. Introducing two nonneg- 
ative functions 


ft (=> (iF I+i(@), P@=5 (IF @ I—-f a), 


it is indeed easy to see that the theorem is true for any function 
f¢€L* provided it has been proved for nonnegative functions. 
In addition we may assume that f (x) takes on finite values every- 
where. So let f (x) CL°(£) and O<f (z)< ~. 
Consider for every 7 a sequence of disjoint sets** 


k= El oe <f(2)<7Z* | (=, 4, 2, ...). 
Then obviously for any n(m = 1, 2, ...) the sum of those sets 


over all k = 0, 1, ... yields a set FH, ie. HK = U Ek, 
h=0 


Construct a sequence {7, (z)} of functions defined on £ for every 
n, by setting f, (c) = k/2" for z that is in ER. Thus every function 
fn (z) is a “step”-function on # (taking on at most a countable number 


of values). 
It is also obvious that for every mn and every point z of £ 


O<f (2) —fa (ze) < 1/2", 


whence it follows that {f, (x)} converges to f (x) uniformly on £, 
Set W, (z) = min {x, fp (2)}. 


* Recall that a normalized linear space A is said to be complete if for any 
fundamental sequence {f,} of elements of that space (i.e. for a sequence {fn} 
for which lim sup ll fm — fn || = 0) there is an element f of R to which tha 

n-o mon 
sequence converges in f. 
** Recall that the symbol £ [} satisfies condition A] designates a set of all 
points of E for which f (z) satisfies condition A. 


24—01684 
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Every function 'f,, (z) takes on on & only a finite number of values, 
the sequence {¥, (z)} converging to f (xz) everywhere on FE. Since, 
in addition. 0< f(r) — V,, (7) << f(z) everywhere on F£, whence 
it follows that [7 (x) — VW, (z)F < F (x), by the corollary of Theo- 
rem 8.419 the sequence [7 (2) — WV, (x) converges to zero in £1 (F), 
i.e. the sequence of I, (xz) converges to f (x) in L* (£). 

Jt remains to prove that every function VY, ¢r) can be approx- 
imated in the norm of L°(£) by a continuous function to an 


unlimited precision. Recall that every function YU, (xz) takes on only 
mM 


a finite number of values. i.e. has the form Y, (z)= “ ap), (z), 
NOY 


where a) (k=1, 2,..., m) are constant) numbers and @, (z) are 
the so-called characteristic functions of the sets £): 


- 4 ona set &),. 
(dp (2) T= 1 
1, QO outside Fy. 


Thus, to complete the proof it suffices to construct a sequence of 
continuous functions converging in 4° (/) to a function w (z) of 
the form 


1 on Eo. 
o(2)=| 0 outside Z,. 


where /y is some measurable set in F. 

For a set £2, and for any n there are an open set. G, containing J, 
and a closed set /, contained in Z, such that the measure of the 
difference G, — F, is less than 4/n*. 


Denote by F the complement of G, and set 
p (Zz, Fn) 
* ' e 4 
O(z, Pn) p (zy, Pn) 
where p (z, /) stands for the distance from a point 7 to a set F, 
It is obvious that every function q, (7) is continuous on 7, equa) 


to unity on /,, equal to zero on F,, and satisles everywhere the 
condition 0 < m, (x) <1. Hence we obtain for the norm of the 
difference Gy, (z) — w (7) the following estimate: 


dex (11,20) 


i 


(0) = 


tn © Hie = \ [p,, (7) —o (2) Pers 


I Gn r 


which completes the proof. 
We now prove the following main theorem, 
Theorem 11.5. For any bounded measurable set FE. a space I (F) 


is separable. 


oT es e . ' 
* my the definition of the measurability of a set 2 and the corollary of 
Theorem 8.5 {see Section 8.2.2). 
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Proof. We first prove the case where £ is a closed interal {a, 5}. 
We prove that in this case we may take the set JJ of 211 polyno- 
mials with rational coefficients asa countable everywhere dense set 
in L? {a, bl*. 

According to Theorem 11.4 any function f(z) in L? (fa. L]) can 
be approximated in the norm of L? (la, b]) to an unlimited preci- 
sion by a continuous function. Further. according to Theorem 1.15 
(WwW eierstrass) any function continuous on fa, lj] can be approxi- 
mated uniformly on that interval (and therefore in the norm of 
L? ({a.b})) to an unlimited precision by an algebraic polynomial 
with real coefficients. 

Finally, it is obvious that an algebraic polynomial with rea] coef- 
ficients can be approximated uniformly on fa, b] and therefore in 
the norm of ZL? ({a. b]) to an unlimited precision by a polynomial 
with rational coefficients. This completes the proof for the case where 
E is a closed interval [a. b}. 

Now let & be an arbitrary bounded measurable set. Since £ is 
bounded, there is a closed interval fe, 4] containing E. 

Let 7 (z) be an arbitrary function in L* (£). Extend it to fa. 5] 
by setting it equal to zero outside £. It remains to notice that the 
function j (x) thus extended is in L? ([a, 6}) and therefore, by what 
has been proved above, can be approximated to an unlimited preci- 
sion in the norm of L? ({a, bf) (and the more so in the norm of L? (£)}) 
by polynomials with rational coefficients. In this case too. therefore, 
polynomials with rational coefficients form a set dense everywhere 
in L* (f). This completes the proof. 

41.2.3. The existence in L* of a closed orthonormal system con- 
taining a countable number of elements. To construct in L? a closed 
orthonormal system of elements we shal] assume the existence in 
I? of a countable, everywhere dense set of elements j,, i, ...- 

We shall prove that a closed orthonormal System can be construc- 
ted using finite linear combinations** of elements of the everywhere 
dense set fy, far ---: fs -- +s 

Such a method of constructing an orthonormal system is usually 
called orthogonalization process. 

We shall assume that there are no linearly dependent*** elements 
among f;, fo, ---: fn. --.~ (otherwise by successively increasing 1 
we should remove out of {f,} every element Th which is a linear 
combination of f,, fo, ---: fn-1)- 

* That such a set Af is countable follows from the countability of all rational 
numbers and from the countability of the number of all polynomials of different 
ezree. 


“2% An element ¥,, is said to be a linear combination of elements iu fr, seag fm 
if there are real numbers Os ny oo oe Oy, SUCH that FL = ayfy + Sofe >... + 


end ‘This means that none of the elements j,, of {7} is a linear combination of 
a finite number of other elements of {f,}. 
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We construct a system of mutually orthogonal nonzero elements 


Wy, We, 2... Va, -.. such that for any nm each of the elements 
Wo Y.. ..., 4, ig a linear combination of f,, fo. .... fy and, 


conversely, each of the elements j,, fa. .--, fy is a linear combi- 
nation of W,, W%., ..., W,*. 

We prove by mathematical induction that the system of clements 
Wo, Wa, ..., WW. ... can be successively defined using the rela- 
lions 


Wey (11.24) 
(fs, ) (ft, Ts) ose (Fs Ves) fs 
Ys, _ (fo, Ys) (fa, Vo) vee (fos VY n-4) fs for nD 2. (11.22) 


e @ @ij@¢- @e@ @ 8% @  $@ @ @ @ #  ®  @®  @ ®  @®  @ ® 


Fass 1) ins Y’) “ee Ins ‘ne -1) fn 


Clearly, the element , defined " (11.21) is nonzero (for otherwise 


for any n the elements f,, f., -..; f, would turn out to be linearly 
dependent). 

Thus for n = 1 all the above requirements are fulfilled. We now 
suppose that the system ¥,, YV,, -» V,-, constructed using rela- 
tions (41.21) and (11.22) satisfies all the above requirements and shows 
that then the system V,, V., ..., VY, constructed using the same 


relations satisfies them as well. 

From (11.22) it is clear that an element YW, is somo linear combi- 
nation of fi, fe, for ..., f, and is thus nonzero (otherwise that 
linear combination would turn out to be a zero element, i.e. the 
elements f,, fo, ..-, f, would turn out to be linearly dependent). 

Further, since the clements fi faye oy fn-1 con be linearly expressed 
in terms of W,, V., ..., W,_, and since the minor in the right- 
hand bottom corner of the determinant (41.22) of f, is equal to 
{ney (f" and hence nonzcro, it follows from (14.22) that f, can 
be linearly expressed in terms of Vj, W., . Wie 

Finally, from (11.22) it follows immediately that the clement 
YW, is orthozonal to each of the elements Y,;, ., ..., W,-;. Indeed, 
if it is any of the integers 1, 2, ..., 2 — 1, then performing scalar 
multiplication of both sides of (11.22) by 1, we obtain on the right 
side a determinant whose ‘th and nth columns are the same. It 
follows from the equality of such a determinant to zero that (V,, 
WF.) = 9 for al] & = 1, 2, ,n— 4 

This completes the induction and thus the system 'F,, W.,... 
woe Vas. satisfying the above requirements is constructed. 


* In terms of Jinear algebra this means that the span of ¥y, Ye, ooo. FY, 
coincides with the span Of fs, fare sar dye 

*® Ty show this it suffices to write equation (11.22) for (n — 1) and perform 
the scalar multiplication of it by W,;. 


Ch. 11. Hilbert Space 373 


On setting now for every nq, = W,/ || V, || we obtain an ortho- 
normal system ,, Mo, - ++) Qn» -e 

The closure of the constructed system {g,} follows immediately 
from the fact that each element of the everywhere dense set {f,} 
is a linear combination of a finite number of elements of {q,}. 

From the countability of the everywhere dense set of elements 
fi» for » +> fny « -» it follows that the closed orthonormal system 
constructed contains at most a countable number of elements. But 
the number of elements of that system cannot be finite, for this would 
mean that the space L* is finite-dimensional*. 

This completely proves the existence in L? of a closed orthonormal] 
system consisting of a countable number of elements. 

We note in conclusion that a closed orthonormal system of ele- 
ments of L* is often called an orthonormal basis**, 

11.2.4. Isomorphism of the spaces ZL? and 2* and its consequences. 
In L? (£), just as in J*, the concepts of weak convergence of a sequence 
of elements and of weak compactness of a set of elements can be intro- 
duced. 

Definition 1. A sequence {f, (x)} of elements of a space L? (E) 
is said to converge weakly to an element f(x) of that space if for any 
element g(x) of L* (E) 


(fro 8) >(f, 8) as nm—0o 
or equivalently 


| fn (2) 6 (@) dz—> \ f(x)g(x) dz as n+, 
E E 


It can be proved, as elementary as for the case of /*, that convergence 
of {f, (x)} to f (x) in the norm of L* (Z) implies weak convergence 
of {f, (x)} to f (x). Of course, the weak convergence of the elements 
of L? (Z) does not imply their convergence in the norm of L* (£) 
(any orthonormal sequence of elements of L’ (Z) may serve as an 
example). 

Definition 2. An infinite set M of elements of L* (E) is said to be 
weakly compact if we can choose a weakly convergent subsequence of 
any sequence of elements {f, (x)} in M. 

The concept of linear continuous functional can be introduced 
in ZL? in a way quite similar to that for /?. 


* That the dimensionality of L* (£Z) is equal to infinity follows immediately 
from the fact that for any preassigned n there are n linearly independent 
elements 4, z, 27, ..., z™-) in IL? (E). 

«* A system of elements {q,,} is said to be a basis of Z°(E) if for any 
element f of L2(E) there is a unique expansion of that element into a series 
boo] 


s} CnQy, With constant coefficients c, converging to f in the norm of L?(£). 
n=1 
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Definition 3. A functional 1 (f) defined on the clements of f of 
I. (£) is said to be linear if for any two clements f and g of L* (E) 
and for any real numbers a and fi, 1 (af +- Be) = al (f) 4+ Bl (2). 
Let us agree tocall elements f, whenever convenient, points of L* (£). 

Definition 4. iA functional | (f) defined on the elements of L* (E) 
is said to be continuous at a point fy of L°(/) if for any sequence {f,} 
of elements of L* (F) converging in the norm of L* (E) to the element 
fo the number sequenee | (f,) converges to I (fy). 

Definition 5. A functional l(f) is said to be continuous if it 
is continuous at each point f of I? (E). 

Jt is easy {o prove, as for the case of /*, that if a linear functional 
of L* (2) is continuous at least at one point of Z° (£), then it is 
continuous everywhere in ZL? (/), i.e. simply continuous. 

The question naturally arises as to where it is possible to oxtend 
Theorem 11.2 on the general form of a linear continuous functional 
and Theorem 11.3 on weak compactness of any set bounded (in 
the norm) to include the case of L* (£). 

We shall establish a close connection between £7 and 2? which 
wil] allow us to immediatoly prove that both theorems ore valid 
for L*. 

We introduce the following fundamental notion. 

Definition 6. Two arbitrary Euclidean spaces R and R' are 
said to be isomorphic if a one-to-one correspondence can be established 
between the elements of Rand R’ so that, provided the elements x‘ and yy’ 
of R' are the images of elements x and y of R, the following requtre- 
ments are fulfilled: (1) the element x’ --y' of R’ is the image of the 
element x + y of R; (2) for any real 2. the element dx’ of R’ is the 
image of the element 2.x of R; (3) the scalar products (x', y') and (xz, y) 
are equal to each other. 

It is established in linear algebra that all n-dimensional Eucli- 
dean spaces are isomorphic to one another and to the space £". 

The principal aim of Section 11.2.4 is to establish that infinite- 
dimensional Euclidean spaces Z* (/) and [? are isomorphic. But 
we shall first prove the following remarkahle theorem. 

Theorem 11.6 (the Riesz-Fisher theorem). Let {sp,} be an 
arbitrary orthonormal sustem in I2(f)*. Then for any sequence 
of real numbers (€4, Co, ..-; Cyne -) that satisfies the condition 


ao 


N" ch. <0, i.e. isan element of 12. there is a unique funetian 
kot 
P(r) in EP (FE) such that e,-=(f. 4) = | f(z) ¢, (2) dx and Sh st 


r he«{ 
“WAP = | P(e) de. 
IP 
* Neither the completencss nor, clearly, Che closure of the system is acsumed. 
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nh 
Proof. Set fn= © ChGr- The sequence {f,} is fundamental, 


m 
since for m>=n ||fm—fall@= >> c and, as stated, the series 
nant 


> cf converges. But then by the completeness of L?(E) (estab- 
d 
lished in Section 8.4.7) there is an element f of L*(#) such that 


lim {fa ~f I= lim |] & cnn —fll=0. (11.23) 


From the last relation and from the Bessel identity (10.17) es- 
tablished in Section 10.1* it follows that 


nN oo 
lim Sch=WIP, ie. 2 ch= IFIP. 

We prove that (f, @,) = c;, for any k. To do this notice that by 
the orthonormality of {p,}, for all x > k 


nh n 
(fos Pa) = (d) CrP; On) = 2 Cr (Pt, Pr) = Ch (11.24) 
and that by the Cauchy-Buniakowski inequality 


IGns O)—, PA I= WM ia—fs Pr) 
SVP Me = Vf — 7 
and by (14.23) 

(fny Pr) > (f, Pr) as m > &. (11.25) 
From (11.24) and (11.25) we get (f, pn) = cn for any &. 

It remains to prove that f is a single element of L* (Z) satis- 
fying all the conditions of the theorem. Let g be any other ele- 
ment of L* (£) satisfying all the conditions of the theorem. From 
the Cauchy-Buniakowski inequality | (fn, —f, g) |< V iif, —FX 
XV il g || and from (411.23) it follows that 

(fn — fr €) +0 as noo, (14.26) 


But from the equation (g, @,) = c, and from the scalar product 
axioms it follows that 


n nT 
(fn—fr B= CX Cn Pa—F, B)= Ni en (Bs On) — 
ra 
—(f, B= 2 ch (hs 8), 


22s 


* This Bessel identity is true for any orthonormal system in an arbitrary 
Euclidean space. 
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so by (11.26) 


N ch=(f, 8). (11 27) 
From (11.27) and from the relations “' ch=]]f {f= and “ d= 
[= kuwf 


={lelP we get 
lf-ecll=-af-H=ifir—-20, os ile =9. 


But this means that the difference f — g is a zero element of L* (£), 
ic. f = g. This completes the proof. 

Remark. If an orthonormal system {q},, is closed or at least com- 
plete, then the element f is unique even without the requirement 


that ‘cf = || f jI? (see in this connection Theorem 10.8). 
} 


Less f 

On the basis of the Riesz-Fisher theorem we prove the following 
main theorem. 

Theorem 11.7. L* (E) and F are isomorphic. 

Proof. Choose in £7 (Z) a closed orthonormal system {,} and 
associate with cach element f of Z* (2) an element c = (¢,, Ca, ... 
» ee; Cy, »--) Of EF whose coordinates c, are of the form c, = ff, 
g,) (k = 1, 2, ...). By Theorem 11.6 this is a one-to-one cor- 
respondence. 

It remains to prove that if corresponding to the elements f and g 
of £7 (2%) are respectively the elements ¢ = (c,, Ca, . ~~ Cny «+ +) 
ond d = (d;, de, ..., dn, ...) of I, then (4) corresponding to the 
element f-+g¢g is the elemont ec 4- d = (c, + d,, c. +- da, ... 
ee ey Cy + Gn, ..-), (2) for any 4% corresponding to 7f/ is Ae = 
= (2.0), ACny «0 y ACny ~~) (3) 


=_—, 


(f, e)=(e, d)= Py Crd), (14.28) 


(this equation is usually called the generalized Parseval formula). 

(4) and (2) follow immediately from the properties of a scalar prod- 
uct*. We prove (11.28). By the closure of {q@,} the following Par- 
seval’s formulas are true for f, ¢, and f + g: 


fy A= Si chs (8s B= 2 di (14.29) 
(f+e. f +e)= S (cy +d)? (14.30) 
* Thus, to prove (1) it suffices to notice that (f — &e Gp) = (Ae G7) He (Oe Gy) = 


= cy + dp. 
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Subtracting (11.29) from (11.30) we get 


2(f, g)=2 2 Crap. 


The proof of the theorem is complete. 

Theorem 11.7 allows 7? to be considered as coordinate notation for 
the elements of L* (Z). It extends all] the statements established for J? 
to L* (Z), and vice versa. 

In particular, it implies the following. 

1°. I* is complete. 

2°. Any set bounded in the norm of £° (£) and containing an in- 
finite number of elements of Z? (£) is weakly compact. 

3°. For every linear continuous functional J (f) defined on the ele- 
ments f of L* (Z) there is one and only one element g of L* (£) such 
that, for every element f of L* (Z), 1 (f) = G, g) with 


_ AUR eee 
I} 24) SUP. TFA lg I). 


From a quantum-mechanical point of view Theorem 411.7 is a ma- 
thematical justification of the equivalence of Heisenberg’s “matrix 
mechanics” and Schrédinger’s “wave mechanics”, the mathematical 
formalism of the former being the coordinate space J* and of the se- 
cond—the space ZL? of functions with an integrable square. 

Theorem 11.7 naturally suggests that 7 and Z° are both but differ- 
ent specific realizations of the same abstract space which we now 
proceed to consider. 


11.3. ABSTRACT HILBERT SPACE 


41.3.1. Abstract Hilbert space. A Hilbert space H with two partic- 
ular forms of which, 2° and L’*, we have already got acquainted can 
be introduced axiomatically as a collection of elements X, Y,Z,... 
of any nature that satisfy a certain system of axioms. 

We enumerate all the axioms which must be satisfied by elements 
of an abstract Hilbert space 77. 

J, (a) Axiom on the existence of a rule which associates with any 
two elements X and Y of a space Hf an element Z of that space called 
the sum of X and Y. 

(b) Axiom on the existence of a rule which associates with any ele- 
ment X of H and any real number ?. an element of JJ called a product 
of X by 4. 
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(c) Eight axioms of a linear space*, 

Hl. (a) Axiom on the existence of a rule associating with any two 
elements Y and ¥ of H a number called a scalar product of X and 
Y and designated (X, Y). 

(b) Four axioms of a scalar product**. 

JIT. Axiom on the completeness of a space J/ with respect to the 


norm defined by the equation |} VY {J = (XY, X)**?*. 

IV. Axiom on the existence in 77 of any prenassigned number of 
linearly independent clements. 

¥. Axiom on the existence in /T of a countable, everywhere dense 
set (in the sense of the norm of J7) of elements. 

In other words, a Hilbert space JT is any linear Euclidean complete 
infinite-dimensional separable space. 

The following notions are introduced in the Wilbert space Jf: 
(1) convergence of a sequence of elements in the norm and weal: con- 
vergence (a sequence of elements {X,,} is said to converge weakly to 
an clement X if for any Y, (Xn, Y) ~+(X, Y) as n —+ 00); (2) weak 
compactness of a set Af of elements of /7 (defined as the possibility of 
choosing a weakly convergent subsequence of any sequence of cle- 
ments of .1f); (3) a linear and continuous functionals ? (X) defined on 
the elements X of /7 (1 (X) is said to be linear if 1 (aN 4- BY) = 
== al (NX) -+ B2(Y) for any elements X¥ and ¥ of JT and any real 
numbers a and fi; 2 (X) is said to be continuous at a “point” NX, if 
1(X), —1(X,) for any sequence {X,} of elements of HW for which 
HX, — Ny |] — 03 2 (X) is said to be continuous if it is continuous at 
each point X of 77). 

The existence of a closed orthonormal system of elements {®,} 
can he proved for the abstract Hilbert space /7 in a way quite similar 


* These axioms can be found in any course of Hnear algebra. They are 

listed here for convenience. 

1. VY + Ys Y-+- N (for any clements X and Y). 

2°, N -+- (Y 4+- 2) = (N -b Y) + Z (for any elements X, Y and Z), 
3°, There is an clement 0 such that XY -+ 0 = X for any element X. 
4°. For every clement NX there is an element X’ such that X -+- X’ = 0, 
5°. @ (BN) = (af)-X for any element X and any real numbers « and fi. 
6°. 1-N = N for any element X. 
~(a-- fy X= aN -- BN for any elerment XN and any real numbers « 


ber @. 
** Axioms of a scalar product can be found in Section 10.1. They are listed 
here for convenience, 
4°. (XN, Y) = (Y. NX) for any elements X and Y. 
9° (x +- Y, Z) = (KX, 2) + (Y, 2) for any elements NX, Y, and Z, 
3°. (aX. Y) x= @ (X, ¥) for any elements X and Y and for any real number a. 
4. (X, X) > 0 for every nonzero clement XY. (0, 0) = 0. 
#¢° For the definition of thre completeness of a normalized linear space soo 
Section &.4.7. 
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to that used in Section 11.2.3 for L? (to do this the orthogonalization 
process for a countable everywhere dense set of elements of H is 
carried out). 

For the abstract Hilbert space H (just as for L?) the Riesz- 
Fisher theorem is true: if {M,} is an arbitrary orthonormal system 
in H and (cy, Co, «-+) Cny ---) IS an arbitrary sequence of real 


numbers satisfying the condition S ch <( oo, then there is a 


———d 
—_ 


single element X in H such that c,=(X, M,) and > ch = || X |[*. 


The proof of this theorem differs from that of Theorem 14.6 only 
in that throughout the reasoning elements of H should be taken in- 
stead of those of L’. 

The Riesz-Fisher theorem allows the following fundamental theorem 
to be established. 

Theorem 11.8. All Hilbert spaces are isomorphic to one another. 

It suffices to prove that any Hilbert space AH is isomorphic to 2’, 
and to do this it suffices to repeat the proof of Theorem 114.7, replac- 
ing throughout the elements of L? by those of H. 

The following statements immediately follow from Theorem 11.8. 

1°. Any set bounded in the norm of H and containing an infinite 
number of elements of H is weakly compact. 

2°. For every linear continuous functional 1(X) defined on the 
elements X of a Hilbert space H, there is one and only one ele- 
ment Y of H such that for all X of H l(X)=(X, Y), with 


_ 0.9) 


Remark. It can be proved that every weakly compact set AZ of 
an infinite number of elements of H is bounded (in the norm of 72). 
In other words, it can be proved that the boundedness of a subset M 
of H containing an infinite number of elemenis is a necessary and suf- 
ficient condition of weak compactness of MM. 

11.3.2. The equivalence of the completeness and the closure of an 
orthonormal system in Hilbert space. According to Theorem 10.7 
in any Buclidean space (and therefore in any Hilbert space) any 
closed orthonormal system is complete. We now prove that the con- 
verse is also true in Hilbert space. 

Theorem 11.9. Any complete orthonormal system of elements of an 
arbitrary Hilbert space H is closed. 

Proof. Let {D,} be an arbitrary complete orthonormal system of 
elements of H, and let V be any element of H. It suffices to prove 
that the nth partial sum §, of the Fourier series of an element V 
with respect to {D,} converges to that element VY in the norm of H. 
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n cc 
“ e oN a 
Let c,=(U, @,), S,= S e,@,. Since cf converges* and 
ke] Kea 


since (by the scalar product axioms and the orthonormality of 
{D,}) for any mien 


a . a 2 
I Sm—Saltl=l] Seta dd=C Senta, MS c,t,)= Sei, 
h=n+{ keent{ Ant] hen+3 


the sequence {5,} is fundamental. 
But then by virtue of the completeness of H there is an element ', 
of H such that 
1S, — Vo |] ~0 as no. (11.34) 


Jt remains to prove that fy = W. To do this it suffices to establish 
that the clements YU and YW, have the same Fourier coefficients**. 
Fix an arbitrary /. For any 2 Sh, by the orthonormality of {9,} 
and the scalar product axioms 


? n 
(Sn Dg) = (Y ody, ,,) = 2 €,(D1, D,) = cy. (41.32) 
l= { [= 


On the other hand, since on the basis of the Cauchy-Buniakowski 
inequality 

| (S,, D),)— (Vo, D;) [= | (S,—Vo, DES 

S<VTSi— Vol UO M= VS — Voll, 
it follows from (14.34) that 

(Sn, D),) > (V5, @D,) as rn —- oo. 

From this relation and from (11.32) we get (W>, @,) =e, = (VU, 
(p,). Thus the theorem is proved. 

Corollary. In a Hilbert space H the completeness of an orthonormal 
system is equivalent to tts closure. 

Remark. For an incomplete Euclidean space Theorem 411.9 is in 


general not truc. 
The following example will demonstrate this fact***, 

Consider a Euclidean space C° of all functions f (x) continuous 
on a closed interval [—z, a] with a scalar product defined by 


st 


(f; gy= { f (x) 2 (x) dz. 


~ 


neal . ° 
* The convergence of that serics follows, for instance, from the Bessel in- 
equality (see Theorem 10.10). 

** Yndeed, the coincidence of all Fourier cocfficients of ¥ and i’, would mean 
that the element w — I, is orthogonal to all ©, and, therefore, by the com- 
Joteness of the system of @,, zero. 

P'ys* This example is due to Sh.A. Alimov. 
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This space is certainly not complete* (and is not therefore a Hilbert 
space). We construct in that space a complete orthonormal system 
of elements which is not closed. We do this in two steps. 

4°. We first prove that in a Hilbert space L* [—x, x] there is a 
complete orthonormal system Qo, 91, Me; -- +; Qn» --~ Such that 
the function @, (z) is discontinuous on [—x, x] and all the functions 
Q, (Z), nm = 1, 2, ..., are continuous there. Set 


4 
—— h 0< <5 3 
Yo @=4 Vi. “< (11.33) 


O when —xn<zx<JQ, 
Yon (2) = VASE ME (nt, 2, ...), 


Vu 
v2 sil nz 
—————. when —1<7<0, 
Pon—y(Z) = Vx —s (n= 4, 2, coeds 
0 when 0X z2=<=2 


Notice at once that the function WY, (x) is discontinuous on [—a, =] 
and all the other functions W,, (z) (n = 1, 2, ...) are continuous 
there. Besides, it is easy to verify that Y, (x) is orthogonal on [—x, =x 
to each of Y,, (x) (for alln = 1, 2, ...). 

We show that although the system {¥, (z)} (x = 0, 1, 2,...) 
is not a system orthonormal in L? [—x, x] nevertheless it is comlete 
in the sense that any element f(r) of LZ? [—zx, x] orthogonal to 
every WY, (x) (with n= 0, 1, 2, ...) is equivalent to identical 
zero. 

Indeed, let f (z) be any element of L? [—x, x] orthogonal to every 
¥Y, (t) (2 = 0, 1, 2, ...). 

From the orthogonality of f(x) to every element of {¥on_, (x)} 
(n=1, 2,...) it follows that on [—x, O] f(z) is orthogonal to 
a system { se" } (n=1, 2,..-.) and therefore, by the com- 
pleteness of that system on [—<x, O] (established in Remark 1 of 
Section 10.3.2), f(z) is equivalent to zero on [—a, QJ]. 

In such a case the orthogonality of f(x) to every element 
Von (z) (n=0, 1, 2, ...) implies that on [0, x] f(z) is orthogonal 


rs 
to a system r= ; V2 cos nz (n=1, 2,...) and, by the com- 
a oie 


pleteness of that system on [0, 1] (established in the same Remark 1) 
f(z) is equivalent to zero on [0, x] too. 


* It suffices to fix some function f, (x) piecewise continuous (but not strictly 
continuous) on [—zx, xt] and notice that (by Corollary 2 of Section 10.3.3) the 
sequence of partial sums of the Fourier trigonometric series of f{, (z) converges to 
f, (z) in the norm of L*? [—ax, xa]. On the basis of the completeness of Z? [—=x, =] 
that sequence is fundamental. Although each element of the sequence is a function 
continuous on [—x, =] its limit in L* [~zx, x], the function f, (z), is not in C®. 
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Thus f (z) is equivalent to zero on the whole interval [—a, a}. 
So the svstem {V(x} (7=O0, 1. 2, ...) is complete in 
E2,—a, a}. Applying the orthogonalization process to the system 
We. WV. Wo. oe, Va, .., we oblain an arthonormal system YV,, 
WF, 1, weer Vay...» Tt remains to normalize the Jast system, 
usr 
i.e. fo set® qo=s 0%, in = ; (with a= f, 2, ...). 
} rniis 
We obtain a complete orthonormal system {y,} (7 = 0, 1, 2, ...) 
whose zero element qo (7) = U9 (x) is defined by formula (41.33) 
and is a function discontinuous on [—2a, 2] and all the other ele- 
ments, being linear combinations of continuous functions, are con- 
tinuous on {—za, x}. 
2°. We now return to the space C° of all functions continuous on 
[—x, a] and prove that the system qy, qa, ..+. Gnas.» + iS com- 
plete in C® but is not closed in it. 
We first show that {¢,} (n =: 1. 2, ...) is complete in C°. Let 
VY be an arbitrary element of C° orthogonal to every g,, with #7 = 
= 4.,2,..., i-c. such that 


(Y, pa) = 0, with n =i, 2,... (11.34) 
Then the function 


f=V— qo (TV, Go) (11.35) 
is an element of L* [—z, x} and satisfies the condition** 
(Gf, Pn) =O for all n = 0, 1, 2,... (41.36) 


By comleteness of {,} (rn = 0, 1, 2, ...) in Lt? f—a, al it fol- 
lows from (11.56) that f is a zero element, but then from (41.35) 
and Jrom the fact that VY (2) is continuous and fy (z) is discontinnons 
on [—a, a] it follows that (VW, m5) = 0. The last equation in con- 
junction with (11.34) implies that Y is a zero clement, i.e. establishes 
the comp]eteness of {p,} (rn = 1, 2, ...) in C% 

We now prove that {q,} (n = 1, 2, ...) is not closed in €° 

nN 


Let P be a polynomial of the form P == “'ayq, with absolutely arbi- 
KA 


trary coefficients a, (k = 4, 2, ..., n). By the orthonormality. of 
fp,} (2 = 0, 1, ...) and by the scalar product axioms 


nae _ i/ 2 2 ~ 
oo—Pil=V(%—P, To—~ P)=V thepo lF + PUPS t. (11.37) 


* We observe that (('Y,, {| == f. 
** Indeed, with n = 4.2, ..., (11.36) follows immediately from (41.34) and 
fram the arthagonality of ¢, to every q,, (n= 4, 2... .). The equation (f, p,S--0 


follows from (11.35), from the scalar product axioms and from the fact that 
(Go To) == 1, 
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Since a set of continuous functions is everywhere complete in L? (—x, 
|, for @, there is a continuous function f (xz) such that 


ll Go —fil< 4/2. (41.38) 


But from (14.37) and (41.38) it follows that || f — P |] > 1/2 for an 
absolutely arbitrary polynomial (with any coefficients), and this 
means that an element f of C° cannot be approximated in the norm 
of L?[—n, m) by a linear combination of elements of {p,} (n = 
=14,2,...), ie. that the system {p,} (n = 1, 2, ...) is not 
closed in C°, 


114.4. COMPLETELY CONTINUOUS SELF-ADJOINT OPERATORS 
IN HILBERT SPACE 


11.4.4. The linear continuous operator. Let H be an arhitrary 
Hilbert space. For convenience we shall label the elements of H 
with small Latin letters z, y, 2, .... 

If there is a rule associating with each element z of H some ele- 
ment y of H, then we say that an operator A from H into 7 is defined 
and write y = Az. 

Definition 1. An operator A is said to be linear if for any elements x 
and y of H and for any real numbers « and {3 


A (az + By) = a-Az + B-Ay. 


As for the case of the functional, we shall (whenever convenient) 
call the elements of H points of H. 

Definition 2. An arbitrary operator A from H into H is said to be 
continuous at a point z, of H if for any sequence {x,} of elements of 
H converging in the norm of H to xq the corresponding sequence {Ax,} 
converges in the norm of IH to the element Azo. 

Definition 3. An operator A issaidto be continuous if it is continuous 
at each point x of —H. 

Definition 4. An arbitrary operator A from H into H is said to be 
bounded if there is a constant C such that for each element x of H || Az ||X 
<C |< ll. 

Kofaitions 4 to 4 are closely similar to the corresponding defini- 
tions for the functional in Section 11.4.2. 

This allows us to give without proof the following statement: 
a linear operator A from H into H is continuous if and only if itTis 
bounded. 

The proof of the statement is absolutely identical with that for 
Theorem 11.1. 

For a linear continuous operator A (just as for a linear continuous 
functional) we can introduce the concept of norm. 

Definition 5. The norm of a linear continuous operator A is the 
supremum of the relation || Ax || /{| x || on the set of all elements x =~ 0 
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of H (or, equivalently, the supremum of || Az {{ on the set of all ele 
ments x of H whose norm jj x {{ is equal to unity). 

The norm of a linear continuous operator A will be designated 
| A |i, So by definition 


| A |{== sup || Az I]. (11.39) 
ay 


Henceforth throughout this section linoar continuous operators 
are considered. 

Kere is an example of a linear continuous operator in Hilbert space. 

Consider a Hilbort space L* [a < ¢ < b} and suppose that we are 
given some function of two variables, A (¢, s), defined and conti- 
nuous in the square fa <t<b] X la <s<b]. We prove that 
au integral operator A given on the elements z (t) of 2? la<t <b] 
by 


b 
Az (1) =| K (1, s)x(s) ds (41.40) 


is linear and continuous. The linearity of the operator follows im- 
mediately from the linear property of the integral. 

To prove the continuity of the operator (11.40) it suffices to esta- 
blish its boundedness, for which it is sufficient to establish the fi- 
niteness of its norm (11.39). We denote by AY a number 


w= [x20 s) at as |”? (14.41) 


and show that {JA {|< Jf. By the Cauchy-Buniakowski inequality 
and the definition of the norm 


b b b 
{| Ax (1)? |< | K2 (1, s)ds | 22(s)ds=fpri? | AB (t, s) ds. 
, "1 ‘ 


On integrating the last inequality with respect to ¢ between a and b 
and using the notation (11.41) we have 


az |< AT [tz Il. 


But this implies the boundedness of A and the validity of the in- 
equality ]A ||]<¢.4/ for its norm. Note that for some integral ope- 
rators (41.40) |I A |] is exactly equal to AT. 

14.4.2. The adjoint operator. We now introduce an important 
concept of adjoint operator. 

Suppose that in a Hilbert space /7 an arbitrary linear continuous 
operator A from JT into /7 is given. 
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Fix an arbitrary element y of H and consider a functional f (z) = 
= fy (z) = (Az, y) (defined on all elements z of H). It is obvious 
that it is linear and continuous. By the Riesz theorem on the general 
form of a linear functional there is a unique element h = hy of H 
such that f (x) = (x, h) for every element x of H. 

So we have associated with each element y of H one and only one 
element of H such that f, (z) = (z, h), i.e. we have defined in 
some operator A* such that h = A*y. It is this operator A* that 
is called the operator adjoint to an operator A. 

In other words, we arrive at the following definition. 

Definition J. An operator A* is said to be adjoint to an operator A 
from H into H if for any elements x and y of H 


(Az, y) = (2, A*y) (14.42) 


It follows from the above that for every linear continuous opera- 
tor A there is a unique adjoint operator A”. 

It is immediate from Definition 1 that if for an operator A* there 
is an adjoint operator (A*)*, then (A*)* = A. 

We now show that for the case where A is linear and continuous, 
A* is also linear and continuous (and for A* therefore there is an 
adjoint operator and (A*)* = A, the last equation allowing the 
operators A and A* to be called mutually adjoint operators). 

Theorem 11.10. An operator A* adjoint to a linear continuous ope- 
rator A is also linear and continuous, with the norms of A* and A con- 
nected by the relation 


A* =A th (41.43) 


Proof. The linearity of A* follows immediately from relation (11.42) 
and from the scalar product axioms. It remains to prove the bounded- 


ness of A* and equation (11.43). 
By (11.42), the relation |] Ay {I< [lA Il lly I]* and the Cauchy- 
Buniakowski inequality, for amy elements z and y of H 


((Atz, y) [= ‘(, AY I<ile il Ay <A Ill ielly ll 
Inserting in this inequality A*z for y we get for any x of H 
| A*x [PF = (Ata, A¥z) < IA Il- ll 2 Il] A*z |] 


or | A*z <A [Hz Mb | 
The last inequality implies that A* is bounded and that its norm 
\| A* |] satisfies the condition 


WA* [<A Il. (114.44) 


* This ‘relation, true for any element y of #, follows from the definition of 
the norm of aJjlinear continuous operator A. 
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The linearity and boundedness (or equivalently the cantinuity) of 
A” ensures the existence of an adjoint operator (A*)* =-1. Repeat- 
ing for that operator the above reasoning we get instead of (14.44) 


A =< WA* I. (11.45) 


Equationg (11.44) and (14.45) vield (11.43). Thus the theorem is 
proved. 

Definition 2. An arbitrary operator A from I into Hf is said to be 
self-adjoint tf for A there is an adjoint operator A* coinciding with A 
(i.e. if (Az, y) = (7, Ay) for any elements z and y of /7/). 

As an illustration, consider again an integral operator (14.40) 
with some function A’ (¢, s) continuous on the square fa < ¢t < b] 
<fa<s< bj (this function is generally called the kernel of the 
integral operator (11.40)). 

We show that the adjoint of the operator A given by equation 
(11.40) is the integral operator A* given by 

b 
Aer) = | K (s, t)xz(s) ds (11.46) 


a 


(by K (s, t) in (11.46) one should mean the same function as in 
(11.40) but integrated with respect to the first independent variable). 
It follows from ares and (14.46) that for any elements x (2) 


and y (t) of the space L* la, b] 
b h 
(Ar, y) -: \ ( | Ket, s}z{s) ds | y(t) dt (14.47) 
: t) I 
(x, aA*y) = | ( VK (ts) (1) dt) x(s) ds. (11.48) 
non 


The right-hand sides of (11.47) and (11.48) differ only in the order 
of integration with respect to the variables ¢@ and s and therefore 
coincide®. Then, so do the left-hand sides and this precisely means 
that the operator A* defined by (11.46) is adjoint to the operator A 
defined by (11.40). 

It follows from (11.40) and (11.46) that the integral operator given 
by (11.40) is self-adjoint ifand only if A(t, s) = K (s, 0) for all t 
and sin fa, b}. The kernel A (¢, 8) satisfying the equation A’ (f, s) = 
= K (s, t) is called symmetric. 

We now prove the following statement. 


* Indeed, for continuous functions z (f) and » (t) the equality of the right- 
hand sides of (11.47) and (14.48) is obvious. But then, by Thenrer 14.4 and the 
Cauchy-Buniokowski Inequality, this is true for arbitrary elements x (1) and 
u(t) of L2{a, bl as well. 
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Theorem 11.12. The norm \\A‘\\ of a linear continuous self-adjoint 
yperator A is the supremum of | (Az, x) | on the set of all elements z 
of H, with the norm equal to unity, i.e. the norm of A is given by 


{| A ff sup | (Az, x) |. (11.49) 
Meet 


Proof. Denote by t the quantity on the right side of (11.49) (the 
existence of the supremum is obvious). To prove that 1 = || A |, 
it suffices to establish the two inequalities p < |] A |{[and p > |[A |f. 

The first of these follows immediately from the fact that on the 
basis of the definition of the norm of an operator and the Cauchy- 
Buniakowski inequality for every element z for which || z || = 


(Az, x) | < || Ax |f- [fx |] = [Az |< |A If. 
It remains to prove the inequality uu => || A ||. Since the operator 
is hinear, for each element z of H* 
| (da, 2) |< pelle Iie (14.50) 


Further, it follows from the scalar product axioms and from thie self- 
adjointness of the linear operator A (i.e. from (Az, y) = (z, Ay)) 
that for any elements z and y of H 


4 (Az, y) = (A (@+y), e+ y) —(A (&— y), E— ¥). 
From this equation and from (41.50) we get 
4|(4z,y)|<apfoty(Ptpelle—y iP = 2p (ei + 
+ ily WP). 
From this last inequality it follows that for arbitrary elements x 
and y of #f for which l{ z |] = lly || = . 
\(Az, y) <p. (14.54) 


On setling in (11.54) y = Az/|| Ax || we get, for every element x 
for which || x || = 4, (Az, Az)/|| Az |] < ju and therefore || Az || < 
<u. Hence jf A {[ <p. Thus the theorem is proved. 

11.4.3. The completely continuous operator. 

Definition. An operator A from H into H is said to he completely 
continuous if it maps each set, bounded (in the norm), of elements of H 
into a compact set. 

In other words, A is said to be completely continuous if for any 
sequence {z,} of elements of # such that || z, || << C = const there 
is a subsequence {z,,} (ck = 1, 2, ...) such that the corresponding 
subsequence {Az,,} converges in the norm of H. 


Recall that a linear operator A is continuous if and only if it is 


* Since for every clement zp)= 
| (Azq, %) | <p. 
25* 


-c with the norm equal to unity 


4 
I] = || 
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bounded, i.e. ifand only if it maps any set bounded (in the norm of H) 
again into a bounded set. Since a compact set is bounded*, any com- 
pletely continuous operator is continuous. It should be added that 
not any continuous linear operator is completely continuous. For 
instance, an identity operator / of the form Ex = z is continuous, 
but it is not completely continuous: it suffices to consider the mapping 
of a bounded noncompact set to see this. 

We prove the following lemma. 

Lemma. Let A te a linear completelu continuous operator jrom H 
into H. Also let {x,} be an arbitrary sequence of elements of If weal:ly 
converging to an element xo and such that |} x, || = 1 for all n. Then 
the sequence {Az,,} converges to the elements Az, in the norm of H. 

Proof. Since A is linear and completely continuous**, according 
to Section 11.4.2, there is an adjoint operator 4* and for every cle- 
ment z, and an arbitrary element uv (Az,, y) = (z,, A*u). From 
this and the weak convergence of {z,} to zy we get H (Az,, vy) —> 
—(z,. A*y) = (Az, v) for any element y of H as 2 — oo, and this 
means that {:f27,} weakly converges to AZo. 

We now prove that {fz,} converges to Az, also in the norm of ff. 

Suppose {Az,} does not converge to Az, in the norm of H. Then 
there is ¢ > 0 such that for some subsequence of elements {z,,,} 


(kK = 4, 2, ...) 
|} Azn, — Az, flee. (11.51') 


Since 4A is completely continuous and {| z, {| = 1 we can choose a 
subsequence {Zn ,} (p = 1.2,...) of {7,,} such that the corre- 
sponding subsequence {Az, ,} converges in the norm of //. Since by 
what was proved above {Az, *}converzes weakly to s1z5, it also con- 
verges to Az, in the norm of #7. But this is contrary to inequality 
(11.51") true for every m, (and clearly for every r,). 

This contradiction proves the lemma. 

Remark. The above lemma is a consequence of a more general state- 
ment: an operator A from H into H is completely continuous if ané 
onlu if it maps anu wealsly convergent sequence {x,} of elements of H 
into a sequence {Az,} converging in the norm of H. 

We shall leave this statement unproved. 

We now show that an integral operator A given by (11.40) (with 
kerne] A (t. s) continuous in the square la <t< b]) x fas < BI) 
is a completely continuous operator. 

Let fr. (t)} be an arbitrary sequence of elements of ZL? fa. UI 
bounded in the norm of L* fa. bJ, i.e. such that for every n 


a, uc. (11.52) 


* See Section 11.1.3, 
*e And therefore contintas. 
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It suffices to prove that the corresponding sequence of functions 
y, (t) = Az, (t) is uniformly bounded and equicontinuous on [a, 5). 
(Then by Theorem 41.12 (Arzela) we can choose a subsequence of 
that sequence converging uniformly on fa, b] and, the more so, in 
the norm of LZ? [a, b}). From (41.52) and the Cauchy-Buniakowski 
inequality we get 


[Yn (é) (=| K (t, $) Zn (8) asle<f | K* (i, s) ds |" IIa, | 


which proves the uniform boundedness of {y, (¢)} on la, b]*. 

Notice further that it follows from the continuity and, hence, the 
uniform continuity of the kernel K (z, s) on the square la <t <b] X 
x a < s <b] that given an arbitrary « >0O we can find 6 > 0 
such that 


_ Kt, ee 11.53 
IK (ts, 8)—K (ty 8) | <p (14.53) 


for every s in [a, 0} and every ¢, and ¢, in {a, b] such that [4 — 
— to | <= 8, 

From (44.52) and (44.53) and from the Cauchy-Buniakowski in- 
equality we get 


b 
| Yn (t2)—Yn (ts) I<) | K (ty 8) —K (ta, 8) [+1 tn (8) | ds< 


b oO 
& & _ 
Soe | btn 0) Lasalle “V4 ds=s 


a 


for every t, and ¢, in la, b] such that | t, — 7%, |<. 

The last inequality proves the equicontinuity of {y, (t)} on la, 0] 
and, by virtue of the foregoing, completes the proof that the operator 
(14.40) is completely continuous. 

11.4.4. Existence of eigenvalues of a completely continuous self- 
adjoint operator. 

Definition. A real number i is said to be an eigenvalue of an ope- 
rator A if there is a nonzero element x of H satisfying the condition 
Ax = he, 

That element x is a proper element of A corresponding to the eigen- 
value Kr. 

If A is linear, then it follows from the condition that z is a proper 
element of A corresponding to an eigenvalue 4 that whatever the 
nonzero real number a the element az is also a proper element of A 


* It suffices, to notice that the kernel K (t, s) is continuous on the square 
faxtixb}) X{aexs< dj. 
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corresponding to the eigenvalue #%. Ft is natural therefore to consider 
all proper elements of a linear operator A to be normalized, i.c. to 
satisfy the condition |f x {| == 1. 

The importance of the notion of proper elements lies in the fact 
that the action of an operator on them reduces to multiplication by 
some constant 7. 

Not every operator A has ecigenvalues*. 

We prove the following main theorem. 

Theorem 11.12, Every linear self-adjoint completely continuous ope- 
rator A from [f into [fT has at least one eigenvalue i satisfying the con- 
dition [#4 { = {A ||. Of al? the eigenvalues of A it is the largest in 
absolute value. 

Proof. Denote by Af and m respectively the supremum and infimum 
of the scalar product (Az, z) on the set of all elements x of JT satis- 
fying || z || = 1, i.e. set 


Ms sup (Az, x), m= inf YAz, z). (114.54) 
Waited tt 


For definiteness we consider the case [Af {> [am [ (the case [ AT [<< 
< |m | can be considered quite similar)y). 

Since | A. | >> |] mj, 17 > 0. We prove that the number 2 = AV 
is an civenvalue of A. 

By the definition of the supremum there is a sequence {z,} of ele- 
ments of [WT such that (Az,, z,) +f and [[z, || = 1. Since {z,} 
is hounded (in the norm of /7), hy the theorem on the weak compact- 


* Far instance, the integral operator (41.40) has no cigenyalue when 
fe) 


a=:0, bea, A (7, s)= » 2-" sin (n+ 4) zsinns. Indeed, let @(z) be an arbi- 
n—-{ 


trary element of £250, a) for which A (xz, 8)-Q (s) ds = (z) and Jet {by} 


~ Ono aI 


be Fourier coefficients in the expansion of m(z) with respect to a complete 


19 «3 
system {a} ofthonormal on [0, a]. If 2=0. then from the pene 
T 


fo & 
ralized Parseval formula ‘ 2-"h, sin (n-+41)27=0, whence all b,=-0 and 
n= f 


oe 


G(z)~ 0. But if? 40, then from the equation | X (z, s) p(s) de = 49 (2) 


1D Conergy 3] 


and from the properties of the kernel A (7, s) ensuring uniform convergence 
ro tw 

of the Fourier serics of @(z) we eet No 2-9), sin (n fi) zn Moby sinner. 
n=~{ nin { 

Since Ax 4, i¢ follows from the last equation that every b,-=0 and 


7 (x)= (}, 
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ness of any infinite set bounded (in the norm of H) there is a sub- 
sequence of {z,} weakly converging to some element zp of H. We re- 
number that subsequence, i.e. denote it again by {z,}. So {z,} con- 
verges weakly to z, of H. But then (by the lemma of Section 11.4.3) 
{Az,} converges to Az, in the norm of #. 

Since A is self-adjoint, (Az,, Xo) = (fn, Ax), from which it 
follows 


(Atn, Lp) — (Ax, Lp) = (A (Zp — Xp), (Ln + Zo)). (44.55) 
Applying the Cauchy-Buniakowski inequality (414.55) yields 
|{Az,, tn) — (Az, Zo) |< In + 2p Hl |] dz, — Ax |] +0 


(for {Az,} converges to Az, in the norm of # and || z, {| = 1). 
We have thus proved that 


(AZ_, 2) ~~ (AZ, Zp). (14.56) 
From this and the relation (Az,, 2,) —A it follows that 

(Axy, %) = d. (44.57) 
We now show that || z) || = 4. By the Cauchy-Buniakowski in- 
equality { (za, y) |< Ilan Il-Ily Il = ILy tl for any element y. Pro- 


ceeding in the last inequality to the limit as n —»oo and consid- 
ering the weak convergence of {z,} to z) we get | (xz, y) |<} y ll 
{for any element y). From this last inequality we get |} z» |}< 1 
when y = Xo. To prove that || Zp || = 1, it suffices to show that the 
assumption that 0 < |] z» || <1 holds leads to a contradiction. 

Let 0 < |[ x || <1. Set yo = Lo/|] Zo |]. Then || yg || = 1 and by 
the linearity of the operator and relation (411.57) 


KN 
(Avos Ya) = eqs (Ato» 0) = gape> 


and this (considering that ) = M) contradicts (41.54). So |] zo {| = 4. 
We now prove that z, is a proper element corresponding to an eigen- 
value 2. 
Using the definition of the norm of an element, the scalar product 
axioms, equation (11.57), and the definition of the norm of an ope- 
rator yields 


|| Ax —~— Aatg ||? = (Atg—AX, AT) —AX_) = 
= |] Ay |? — 2% (Azo, 2p) +4? [I xo [?= I] AIPA? 


By virtue of Theorem 14.41 the right-hand (and therefore the left- 
hand) side of the last relation is zero. But this means that Ary = Ax», 
i.e. that z, is a proper element of the operator A corresponding to an 
eigenvalue 4X. 

In the case | Mf |< | m | the reasoning is similar, but A should 
be set equal to m. 
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The only thing that remains to be proved is (hat if there are other 
eigenvalues, then the eigenvalue 7 satisfying the condition {4 { == 
== {{ i |[ is the Jarzest of them in absolute value. Let 2, be some 
other eicenvalue and let z, be the corresponding normed proper cle- 
ment. Then Az, = 4,2, and therefore (Az,, 7,) = 4,. But then it 
follows immediately from the relation* 


|A]}s= sup | (Az, z)] 
{}xfj==4 
xedi 


that [2%] D> [4, | 
Thus the proof of the theorem is complete. 
Using the theorem just proved we consider the so-called Fredholm 
equation of the second kind, i.e. the relation 
h 
x(t) -=yt\ A(t, s)z(s)ds (11.58) 


c 


from which given a kerne} A (¢, s) a function f (t) different from iden- 
tical zero is defined as well as those values of the parameter qc for 
Which such a function exists. Those values of tt for which there are 
solutions z (¢), not identically zero, of the integral equation (11.58) 
are called eigenvalues of that equation. Every nonzero solution of 
(11.58) corresponding to a given eigenvalue is an eigenfunction of 
that equation. 

The inverse of an eigenvalue of the integral equation (11.48) 
is generally called a characteristic number of the equation. 

Obviously, if we introduce into consideration an integral operator 
A given by equation (11.40), then the eigenvalues of A are character- 
istic numbers of the integral equation (11.58) and the corresponding 
proper elements of A are the eigenfunctions of (14.58). 

Jt has beon proved in Sections 11.4.4 to 11.4.3 that if the kerne] 
A (t, s) is continuous in the square la<t<b) x laxs<b) 
and symmetric, then the operator (11.40) is linear, self-adjoint and 
completely continuous. 

By Theorem 11.42 the intogral equation (11.58) with such a ker- 
nel A (t, s) has at least one characteristic number. For equation 
(14.58) to have at least one eigenvalue it should he required that it 
should have at least one nonzero characteristic number for which 
the condition that A (t,s) does not become an identical zero** should 


* This relation follows from (41.54) and from the fact that 2 = Af when 
} Af) > fomand A=: m when | WI < | mf. 

** The condition that the continuous kernel A (1, 5) does not become an 
identical zero is a necessary and sufficient condition for an integral operator A 
defined by (§1.40) to have nonzero cigenvalucs. Indeed, by Thearem 11.12 
YA I} == {fA f], where 2 is the largest absolute eigenvalue of A, eo that it euffices 
to prove that {| A ff == Oifand only if A (t, s) isnot identically zero. If K (t, #) ++ 
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be added to the requirements that K (t, s) should be continuous and 
symmetric. 

So we arrive at the following fundamental statement: if the kernel 
K (t, s) of the Fredholm equation of the second kind (11.58) is conti- 
nuous in the square fa <ct<_b| x la<xts< dl, symmetric, and is 
not identically zero, then the equation has at least one eigenvalue. 

Remark. We could prove that the above statement is also true- 
if the requirement that X (i, s) should be continuous on fa <t< 
<b] X la<s< b] is replaced by a weaker one, that there should. 
be a finite integral 

bb} 

} | x (t, s) Ut ds. 

aa 


(It suffices to show that when that weaker requirement holds the- 
integral operator (14.40) from ZL? [a, b] into ZL? {a, b] continues to- 
be completely continuous.) 

41.4.5. The basic properties of the eigenvalues and proper elements. 
of a linear completely continuous self-adjoint operator. In conclusion 
we discuss the basic properties of the eigenvalues and proper ele- 
ments of an arbitrary linear completely continuous self-adjoint 
operator from AH into HH. 

4°. Proper elements x, and x. corresponding to two different eigen- 
values h, and i. are orthogonal. 

Indeed, on the basis of the properties of a scalar product, the equa- 
tions Az, = 4%, AX, = dq®, and the selfi-adjointness property’ 
of the operator A we get 


(hy — ho) (x, La) = (Aq21; Zo) —_ (2, NoLe) = 
= (Az, Za) —_ (xy, AZs) = Q. 


Since A, = Aq, from the equation obtained it follows that (a, x.) = 0. 

2°. There may be several proper elements of A corresponding to- 
the same eigenvalue 4. We prove, however, that any nonzero eigen- 
value 2X may have corresponding to it only a finite number of linearly: 
independent proper elements*. 

Suppose that there are an infinite number of linearly independent 
proper elements corresponding to some A =« 0. On carrying out the- 
orthogonalization and normalization process for those elements we- 


= 0, then clearly j/|A || = 0. But if, conversely, {| A || = 0, then the operator A 
given hy (14.40) maps all nonzero elements of L*{a, db] into the zero element 
and in particular it maps into the identical zero all the elements {z, (t)} of some: 
complete orthonormal system in Z*{a, 5]. But this means that XK (¢, s) = 0. 

* The zero eigenvalue A = 0 may have corresponding to it an infinite number 
of proper elements, too. For instance, for the integral operator (14.40) with 
K (t, s) identically zero each element of some orthonormal} system, {z, (t)} of 
elements of L* [a, b) isa proper element corresponding to the eigenvalue 2 = 0. 
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obtain an infinite orthonormal system of clements {7,) in Wf, each 

a proper element of -t corresponding to an eigenvalue #& 5% Q. Since 
ay 

a e ~) sy a 

for any element y of H the Bessel inequality “. (ra, uy)? < av dF 


naif 


is true, lim (z,, vy) = 0 = (0, y), ie. the sequence of proper cle- 
Tho 

ments {z,) weakly canverges to the zero clement Q. But then it fol- 
lows from the condition that the operator A is completely continuous 
and from the lemma of Section {1.4.3 that the corresponding se- 
quence {Az,} converges in the norm of 7 to the clement Al 0 = 0. 
By virtue of the relation siz, = Ar, we eet JA | = ff fz, || ~U 
(as n — oo), which means that [2 | = 0 and contradicts the candi- 
tion A254 0. This contradiction proves that every 2 52 0 may have 
corresponding to it only a finite number of proper clements. 

The above reasoning also shows that all proper elements (hath 
corresponding to the same eigenvalue % and to different 2) may he 
considered ta be mutually orthogonal (and to have narms equal to unity). 

Oo. We now prove that if an operator A has an infinite number of 
eigenvalues, then any subsequence {t.,} of eigenvalues ts infinitesimal, 

Let {a,} be any sequence of eigenvalues, and let {x,} be the eor- 
responding sequence of proper elements which (by virtue of the rea- 
soning followed in proving Property 2°) may be considered to be 
orthonormal. Writing for any element y of £7 the Bessel inequality 
with respect to the system {7,} we see thal {z,} weakly converges to 
the zero element. Since the operator A is completely continuons, 
it follows from the lemma of Section 11.4.3 that the sequence (A2,)} 
‘converges to the zero element in the norm of 77. But then the equa- 
tion Av, = A,2, has as a consequence the relation 


[an | = |] dz, |] +0 (as n - oo). 


Property 3° allows us to state that the eigenvalues of a linear com- 
pletely continuous self-adjoint operator have no other limit points on 
the number axis except for the sero point*. 

This means that all eigenvalues may be numbered in the order of 
their non-increasing absolute values, so that 


[ay bee Lag be tag bee eee lan les 


with | 2, | ~0 as n — oo, 

In particular, all the properties we have established are true for 
the eigenvalues and characteristic numbers of the Fredholm equa- 
tion of the second kind ({£.58) with a kernel A (¢, s) that is conti- 
nuous on the square la <t<b) x las < bi and symmetric. 


* For any ¢ > Ofthere{mayjbe only a finite number of eigenvalues outside the 
interval [(—e, &). 


CHAPTER 12 


FUNDAMENTALS OF THE THEORY 
OF CURVES AND SURFACES 


In this chapter we present some facts about curves and surfaces 
of importance for applications. 


12.1. VECTOR FUNCTIONS 


12.1.1. The vector function®. We introduce the concept of vector 
function of m variables. 

If each point M of a set {M} of points in an m-dimensional Eucli- 
dean space E™ is assigned according to a certain law some vector r**, 
then a vector function r =r (MM) is said to be given on {IM}, the set 
{Af} being the domain of r = r (M). If p = m, then, as in the case 
of m = 2 or m = 8 (see Section 6.2.1), a vector field defined by the 
vector function r (4) is said to be given on {Jd}. 

The vector r (J4) corresponding to a given point Jf of {/%} will 
be called a particular (or special) value of the vector function at the 
point M. The collection of all particular values of r (M) is called 
the set of values of the function. 

If {17} is a set of points on a given straight line and {w} is the set 
of their coordinates, then r (47) may obviously be considered as a 
vector function of a single scalar variable uw: 


r=r (u). 
If, however, {1%} is a set of points in an m-dimensional space and 


(U1, Wy, ..., Uy) are the coordinates of a point A, then r (MM) is a 
vector function of the scalar variables u,, U2, ..., Um? 


r=P (uy, Uo, «++; Um). 


Remark. Let {7,, 72, . . -, “p} be the coordinates of r (Mf). Obvious- 
ly, giving a vector function r (J) is equivalent to giving p scalar 
functions r, (MM), r. (MM), . . ., rp (J). 

Let vectors r (J) be in an Euclidean space £”. We shall assume 
that the initial points of those vectors coincide with the origin of the 
Cartesian system chosen in £”. In this case the point set of the end 
points of r(M) is called the locus of r (AM). The locus of a vector 

* Some facts about vector functions were given in Section 5.1.6 of [4]. 


** The jvector r is in general in a p-dimensional Euclidean space E? and is 
therefore defined by the coordinates r,, re, ..., Tp: 
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function of a single scalar variable is in general a curve. The locus 
of » vector function of two variables is in general a surface. 

12.1.2. The limiting value of a vector function. Continuity. In 
close analogy with ordinary functions the concepts of limiting value 
and continuity can be introduced for vector functions. 

Asa preliminary we introduce the concepts of a convergent sequence 
and of limit of a sequence of vectors. 

A sequence {a,)\ is said to converge to a vector a if given any © > 0 
we can find N such that for n> N* 

la, ~a|<e. 

Phe vector a is the limit of {a}. 
In symbols 
lim a, =a. 
ty CO 

Remark. If {@yn, @any + + +1 @pn} and {@), dg... .. Ap} are Tespec- 
tively the coordinates of a, and a, then the convergence of {a,} to a 
implies the convergence of the number sequences {an}, {Gon}, -- 
. ++, {@p_) to the numbers a,, dz, -. +, Ap, respectively. Also note 
that the convergence of the number sequence to ay, aq, dp, respectively, 
implies the convergence of a sequence {a,} of vectors with coordinates 

Qiny Gens + + +) App} to a vector a with coordinates {a), dy, .. «+ Gp}. 
The remark follows from the following obvious inequalities” *: 


lpn — On | <la,—a| < Jay, — 0, | + | Aen — a, | + see 
in | apn — Gp |. 


Consider a vector function » = 2 (A/) defined on a set {:/} of 
points of an m-dimensional Euclidean space and a point A, possibly 
not in {A/} but having the property that in any neighbourhood of the 
point there is at Jeast one point of {.1/} different from A. 

Definition 1. A vector b is said to be the limiting value of a vector 
function r (M) at a point A (or the limit of r (.1/) as Af + A) Tf for 
any sequence M,, Mo, ..., My, ... of points of {.\/} converging to A 
whose elements M,, are different from A*** (\/,5¢A) the corresponding 
sequence Fr (.4/y), 7 Ge), oo (2/,), .- . of the values of r (aT) con- 


verges to 
Jn symbols 


lim r(Af)=b or Jim r(uy, Us, ..., Um) = 9, 
Moa Uy; 


where @y) Ars + + +: Gq are the coordinates of Al. 


¢ The absolute value ja | of a vector a with coordinates {@y, ag, .. «. ay} iF 
a number Vay ah eo ah 
e* The vector a, — a hascourdinates {ay — Gye day mo Uae soy Opn mm Oy} 
ees This requirement is accounted for, in particular, by the fact that a function 
> (M2) may or may not be defined at A. 
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We shall not give any definition of the limiting value of a vector 
function in terms of “se — 6’, neither shall we give any for the case 
where 7 tends to infinity. They can be formulated in full analogy 
with the corresponding definitions for scalar functions. 

Let a point A be in the domain of a vector function r = r (Al) 
and let any neighbourhood of it contain points of the domain differ- 
ent from A. 

Definition 2. A vector function r = r (AQ) is said to be continuous 
at a point A if the limiting value of the function at A exists and is 
equal to a particular value r (A). 

A vector function r = r (A) is said to be continuous on a set {/7} 
if itis continuous at each point of that set. 

42.4.3. The derivative of a vector function. In Section 5.4 of [1] 
we discussed the derivative of a vector function of a single scalar 
variable. For convenience we shall formulate that notion once again. 


Fig. 12.4 


Let r =r (u) be a vector function of a scalar variable u. Fix a 
value wu of the variable and assign to the variable w an arbitrary 
increment Au =< 0 such that w + Au is in the domain of the function. 
Consider a vector 


Ar =r (u-+ Au) — r (u). 


In Fig. 12.4 it coincides with the vector MP. On multiplying Ar by 
a number 4/Au we obtain a new vector 


fo = A Ir (ut Au)—r (w)] (12.1) 
collinear with the old one. The vector (12.1) represents the average 
rate of change of the vector function on the interval [u, u + Av]. 

The derivative of a vector function r = r (wu) at a given point u is 
the limit of the difference quotient (12.1) as Au —>0 (provided the limit 
exists). 

ar 


Jt is designated 7’ (u) or ar 


Geometric considerations* make it clear that the derivative of a 
vector function r = 7 (uw) is a vector tangent to the locus of that 


* They are supported by the statement in Section 12.2.2. 
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function. We show the relation of the derivative of r = r qu) to the 
derivatives of its coordinates. For simplicity, consider the case where 
the values 7 (wu) of a vector function are vectors of a three-dimensiona} 
space. Let {x (u), y (u), o (u)} be the coordinates of the vector function 
r(u)., Obviously, the coordinates of the difference quotient (12.1) are 
equal to 


x(u-- Aw)—sz (uw) ur (t~-- Au) —~ te (u) 2(t-» Auj—s (uj 
Au ? Au y Au ° 


According to the remark of Section 12.1.2 the coordinates of r’ (u) 
are equal to the derivatives 2° (wu), y’ (uv), s° (u) of the coordinates of 
r(u). Computing the derivative of the vector function therefore 
reduces to computing the derivatives 
of its coordinates. 

Remark 1. The vector function » (1) 
can be regarded as the law of motion 
of a particle along the locus L of the 
function, if the variable uv is considered 
as time. The derivative r’ (uv) is there- 
fore equal to the rate of motion of the 
particle along the curve ZL. 

Remark 2. Note jthat the rules of 
differentiation of the various products 
of vector functions (the sealar, thefvec- 
tor, and the triple one) are identical 
with those for differentiation of the products of ordinary functions. 
This follows from the fact that the coordinates of the derivative of 
a vector function are equa) to the derivatives of the coordinates of 
the function itself and from expressing those products in terms of 
the coordinates of the factors. 

Iere are the rules of differentiation of vector function products: 


{r (wv) s (ay =r! (ua) s (eu) -ber (u) s (1). 
{Ir (2) s (ue)}}o = Er’ (ut) s (ee) + Ur (a) 8’ ()], 


{fr (u) s (uw) t(u)} =r’ (u) s (u) t(u) + or (Wu) 8? (u) O(a) + 
-or(u) s (u) U (uv). 


We now proceed to discuss differentiation of vector functions of 
several scajar variables. Since in what follows we shall use vector 
functions of lwo scalar yariabJes uv and v, we shal] confine ourselves 
to this case. 

Let a vector function r = r (u,v) be given in some neighbourhood 
G of a point Alp (ta, Up) (Fig. 12.2). Consider in the plane (u. v) some 
direction defined by the unit vector @ with coordinates cos @, sin a. 
Draw through J./, an axis 7 whose direction coincides with that of a, 


Fig. 12.2 
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take on 2 points Af (u, v) and denote by J the magnitude of a directed’ 
segment J, of l. The coordinates (u, v) of a point MW are given by 


u=U,+loosa, v=vp + lsina. 


On the axis 2 the function r = r (u, v) is obviously a vector function 
of asingle variable l. If that function has at the point | = 0 a derivative 
with respect to the variable 1, then that derivative is said to be the deri- 
vative of r=r(u, v) at the point My (Up, Vp) with respect to the 


direction of 2 and designated a ; 


Remark 3. If the direction of @ coincides with that of the 
coordinate axis u(axis v) (in Fig. 12.2 these directions are indi- 


Fig. 12.3 


cated by the broken lines), then the corresponding directional 
derivative is said to be the partial derivative of r(u, v) and 


designated aa Or ly (> or ry } if a is defined al every point 


Ov 
of some neighbourhood of Jf (u, v), then it is a vector function 
in that neighbourhood. It may in its turn have a partial deri- 
vative, with respect to the variable u, for instance. It is natural 


to call that partial derivative the second partial derivative with 
respect to wu and designate it ae (or ry). Other partial deriva- 
tives of various orders can be similarly defined. 

The geometrical meaning of the directional derivative will be 
clear from the following reasoning. The locus of r=rvyu, v) is 
in general a surface § (Fig. 12.3). When a point Jf (u, v) moves 
along 7, the end point P of r(u, v) describes on S a curve ZL 


Which may be regarded as the locus of a vector function of 

a single variable /. The derivative s with respect to the direc- 

tion of 2 is therefore the vector tangent to LZ at the point Py. 
If the direction of 2 coincides with that of the coordinate 


axis u, then as a point Mf moves along the corresponding 
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axis passing jthrough Wf, the end point of the vector r(u, 2) 
describes on S a curve cabled a coordinate curve uw(it is desigenat- 


ed by the broken line in Fig. 12.3). Thus — is the vector tan- 


° r ° . e a 
gent to the coordinate curve u. The partial derivative — is the 


vector tangent to the coordinate curve v. 

$2.1.4. Differentiability of a vector function. Zhe inerement for 
total increment) of a vector function r = r (u,v) ata point MW {u, v) 
{corresponding to the increments Au and Av of the independent 
variables) is the following expression: 


Ar =r(u-+ Au, v + Av) —r(u, v). 


A vector function r = r (u, v) is said to be differentiable at a point 
M (u,v) if its total increment at that point may be represented as 


Ar =a Aut b Av+a Au + f Ap, (12.2) 


where a and & are some vectors independent of Au and Av and « and 
-are infinitesimal vector functions as Au ~~ O and Av ->-0* equal to 
zero when Au = Av = 0**, 
Remark 1. [fr=r(u, v) is 
differentiable at a point af (u, v), 
then obviously aand b are equal to 


or or . : 
= and =~ at the given point, re 


spectively. 

Remark 2. Let v=r(u, v) bea 
vector function differentiable at a 
point Af (u, v) and Jet 2 be some 
Fig. 12.4 axis passing through 37 in the plane 
(u, v)and making an angle @ with 


the axis u. Then the derivative x with respect to the direction 


of I exists and can be found from the formula 


or aye or, ‘ 
—- = — cos @-+ —— sina. (412.3) 
ol Ou ou 


Indeed, for the direction of 2 we have Auz=lcosa, Av-~Isin2 
(Fig. 12.4). Substituting these values of Au and Av in relation 


(12.2) and using the relation <= lim Sr we see that formula 
10 
(12.3) is valid. 


* A vector function « (Au, Av) is said to be infinitesimal if ite Hinit as 
Auw-~( and Ar— 0 equals zero (the zero vector}. 

** We give no definition of differentiability of a vector function of a single 
senlar variable. [It can be formulated in close analogy with the corresponding 
definition for scalar functions of a single variable. 
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Remark 3. We have seen that in the case where the function 
r=r(u, v) is differentiable formula (12.3) is true. It follows 


from this formula that all vectors 2~ are in the plane of the 


ol 
vectors -— and oe It is natural to call the plane passing 
through the point of the locus of r(u, v) corresponding to a 


point Af (uv, v), and parallel to ae and ae the tangential plane 
to a surface § which is a locus. In Fig. 12.3 the plane x is 
the tangential plane to the surface S at Pp. 

12.1.5. The Taylor formula for vector functions. The Taylor ex- 
pansion formula for a function r = r (u, v) with centre at a point 
M (u, v) and with remainder in the Peano form is as follows: 


r(u--Au, v-+Av)=r(u, v)$+2E ay 4 Er ay + 

+p (SE pue4 220) ava + 

+g 08) bebe (Gea Au 

+n are) Auta + wee +S Ave | +R, (Au, Av), (42.4) 


where the remainder &, (Au, Av) is a vector whose order of small- 
ness is higher than p” (p = V/V Au® + Av*)*. 

The validity of formula (12.4) can be shown by representing each 
of the coordinates of the vector r (uw, v) by the Taylor formula with 
remainder in the Peano form and then writing the expression for 
r(u -+ Au, v + Av) with the aid of an expansion with respect to the 
basis vectors (the coefficients of the expansion will be the coordinates 
of that vector). 

12.1.6. Integrals of vector functions. We have already noted that 
a vector function is defined by its coordinates which are scalar 
functions. This allows the operation of integration to be carried over 
to the case of vector functions. 

Let a vector function r(w) be given, for instance, on a closed 
interval [a, b) and let its coordinates r, (w), rp (u), r3 (u) be functions 
integrable on [a, b}. If e,, e., eg are basis vectors, then it is natural 
to set by definition 


b b b b 
\ r(w) du=e, \ r,(u)dute, \ ro(u) du-+e, { r;(u) du. 


* The order of smallness of a vector is defined to be the order of smallness of 
its absolute value. 


26—01684 
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rules of integration of scalar functions can be extended to include the 
case of integrals of vector functions. 


12.2. SOME FACTS FROM THE THEORY OF CURVES 


12.2.1. Regular curves. In Section 11.1 of [1] we discusseq the 
notion of curve and the methods of its representation. One of the 
methods indicated there was the parametric method consisting jn 
representing the coordinates of a variable point of a curve as a func. . 
tion of a scalar variable, the parameter. Assuming those coordinates 
to be the coordinates of the vector leading from the origin to the 
point of the curve we obtain a vector function whose locus is the 
given curve. Thus we may represent a curve by a vector function of 
a single scalar variable and this method is equivalent to giving a 
curve In parametric representation. 

Let a curve L be given by a vector func- 
lion » = r (t)*. Assume that using the rela- 
lion ¢ = f (wu), where f (uw) is a Strictly in- 
creasing and continuous function, the para- 
meter ¢ is replaced by another parameter, 
u. The function r = r(t) converts into a 
new function r = r (f (u)) of the parameter 
u. Thus we can obtain different parametri- 
zations of the same curve. 

We shall say that a curve L is regular 
(k times differentiable) without Singular — 
Fig. 12.5 points if it allows a parametrization using 

the parameter ¢ such that the vector func. 
tion r = r (t) for some integer k = 1 is k times differentiable and 
r’ (t) 40 for all values of t. With k = 1, the curve is said to be 
smooth. 

In this chapter we shall consider regular curves without singular 
points and those parametrizations of such curves for which r’ (t) + 0. 

12.2.2. The tangent line to a curve. Let L be a curve and let P 
be atfixed point on it (Fig. 12.5). Draw a chord PM. The straight 
line PQ to which the chord PM tends** as M—- P is called the tangent 
(tine) to L at the point P, 

The followingYstatement is true. 


e . i ‘ 
. tor function r = r (ft) is usually called the radius vector of the curve & 
” We shall say that the frctehe line PM tends to the straight line PQ 4s 
M — P if the angle between them tends to zero. 
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A smooth curve L without singular points has the tangent at every 
point P. 

We prove that the tangent is the straight line PQ passing through 
the point P and parallel to the vector r’ (z) (recall that r’ (t) 5€ 0). 
Indeed, the vector ois parallel to the chord PJM (see Fig. 12.5) 
and tends to r’ (t) as At 0. It follows that the angle between the 
straight line PAY and the straight line PQ tends to zero as M + P. 
Therefore the straight line PQ is the tan- 
gent to the curve L. Thus the statement is 
proved. m 

We derive a vector equation of the tan- 
gont to acurve L atapoint P. Let R be the 
radius vector of a variable point @ on the 


tangent at P. The vector PQ ='R—r (t) 
is collinear with r’ (t) and therefore 
R—r (t) = ur' (t). From this we obtain 
the desired equation 


R=r(t) + ur’ (2) (12.5) 


where uw is the parameter and zis the fixed 
parameter value on £ defining P. 

12.2.3. The osculating plane of a curve. Let PQ be the tangent 
{o the curve L at P (Fig. 12.6). Through PQ and the point M of L 
draw a plane POM. The plane x to which PQM tends* as M > P 
is called the osculating plane to the curve L at P. 

The following statement is true. 

A regular (at least twice differentiable) curve L without singular 
points has the osculating plane at every point at which r’ (t) and r” (t) 
are not collinear. 

We prove that an osculating plane is the plane x passing through 
the tangent PQ and parallel to r” (t). Obviously the vector 


Fig. 12.6 


n = (r’ (t) rr’ (t)J (12.6) 
is the normal vector to the plane x and the vector 
m=z fr" (t) Ar), Ar=r(t-+At)—r (2) (12.7) 


(see Fig. 12.6) is the normal vector to the plane PQM. Since L is 
twice differentiable, by the Taylor formula 


Ar=r’ (t)At-++r" () AZ 4+a-AR, (42.8) 


eee 


* We shall say that the plane PQM tends to the plane m as M ~+ Pjifithe 
angle between them tends to ‘zero. 


26% 
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where @ is an infinitesimal vector function as Af +0. From formulas 
(12.6) to (12.8) it follows that 


m= {r’ (t)r” (1+ 2 (r’ (jal =n +f, (12.9) 


where Bp = 2 [r’ (t) a) is an infinitesimal vector function as At — 0, 
It follows from relation (12.9) that the vector mz tends to nas Mf —-> P 
and consequently so does tho angle @ between the planes PQA and a. 
Therefore the plane x is the osculating plane to the curve at P. 
Thus the statement is proved. 

We derive a vector equation of an osculating plane. Let 2 he 
the radius vector of a variable point S of that plane. The vectors 


PS = R—r(t), r’ (t), and r” (t) are parallel to the osculating 
plane and therefore R —r (t) = ur‘ (ft) + vr’ (t). From this wo 
obtain the desired equation of an osculating plane 


R=r(t) + ur’ (t) + vr" (0, (12.10) 


where uw and v are the independent variables of the function 22 and 
is the fixed parameter value on Z corresponding to the point P. 

We obtain the equation of an osculating plane in another form. 
since R —r (t), r’ (f), r” (f) aro coplanar, RF satisfies the following 
equation: 


(R —r (f))r’ (t)r" (1) = 0. (12.44) 


If X, Y, Z are the coordinates of R (the coordinates of the variable 
point S of the plane z) and z (é), y (t), = (t) are the coordinates of 
r (t), then in coordinate form equation (12.11) becomes 

X—2z(t) Y—y(t) Z—< (t) 

z(t) y(t) a" (t) t= 0. (12.12) 
z(t) y(t) a" (2) 

Equation (12.12) is obviously the equation of an osculating plane. 

Remark. We have defined the osculating plane geomotrically by 
means of proceeding to the limit, and therefore if it exists it is 
unique. From this and from the statoment proved hera it follows that 
if at a givon point a of a curve there is an osculating plane, then for 
any parametrization of the curve r” (¢) is parallel to that plane. If 
the parameter f is considered as time, then r” (t) is the acceleration 
vector as the point moves along the curve Z according to the law 
r (t). Thus with any method of motion along a curve, the acceleration 
vector at a given point is in the osculating plane of the curve at that 
point. Therefore the osculating plane is also called the plane of acce- 
leration. 

A straight line passing through a point P of Zand perpendicular to 
the tangent at that point iscalled @ normal. A normal in the osculat- 
ing plane is called the principal normal of the curve and a normal 
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perpendicular to the osculating plane is the binormal to the curve. 
Derivation of equations of these straight lines will be left to the 
reader. 

12.2.4. The curvature of a curve. Let P be a point of a regular 
curve Z without singular points and let M7 bea 
point of ZL different from P. Denote by 9 the 
angle between the tangents at P and 7 and by 
1 the length of an arc PM* (Fig. 12.7). 

The curvature k, of L at P is the limit of the 
ratio g/2 as 1 +O (i.e. as MZ — P). 

The following statement is true. 

A regular (twice differentiable) curve L without 
singular points has a well-defined curvature k, at 
each point. 

We proceed to prove the statement. Let P and 
M correspond to parameter values ¢ and ¢ + At, Fig. 42.7 
respectively. 

Evaluate sin m and J. Since Z is regular, r’ (¢) = 0 at any point of 
£ and therefore 


we 


—G 
NX 
x 


oy Er’ er’ (E+ Ad)} | 
P= THO ADT? (12-9) 
t+At 
l= \ [r’ (t) | dv== |r? (t*) | At= |r’ (¢) | At+ 6Az (12.44) 


t 


where 6 -+0 as At —0. 

Note that in the transformations of the expression for 1 we used 
the mean value formula for the integral and the continuity of the 
function r’ (t). 

We transform the expression (42.143) for sing. By the Taylor 
formula 


r’ (t+ At)=r’ (t) +r’ (t) At+aAt, a-0 as At —0. 
Using this formula makes the expression (12.43) for sin m take the 
following form: 
ir’ ()r” ®))1+8 

Tr’ (®) 12-hy At, (12.15) 


where 6B +0 and y —0O as At —>0. 
Turning to formulas (12.14) and (412.15) and using, with » + 0, 
the identity 


sin @= 


* Since Z is regular, any arc PM of L is rectifiable, 
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(with q=0, the ratio + is equal to zero ) we get 
PF ir’ Gr" (OU B (12.46) 


1 “~ sing Ir’) peep 


where ff and p tend to zero as At +0. Since p 0 as At 0, 
—— ~-41as At +0. Therefore it follows from (12.46) that as 


Ai +0, ic. as AY + P, the limit — exists and equals 


i 
Suny caay ane Thus the statement is proved. 
So under the hypotheses of the statement the curvature /, oxists 


and can be found from the formula 


Ur ron 5 
hy (12.17) 


Remark. If the arc Jongth 7 is chosen on a curvo as a parameter, so 
that r =r (l), then [r’ (1) | = 1 and r’ (/) is orthogonal to r’ (/)*. 
In this case obviously formula (12.17) takes the following form: 


ky = |r" (d }. (12.18) 


12.2.5. The torsion of a curve. Let P bea point of a rogular curve 1 
without singular points and let AY be a point of LZ different from P. 
Denote by g the angle betweon the osculating points at P and Af 
and by J the length of an arc P.M. 

The absolute torsion | k,. | of the curve LZ at P is the limit of the 
ratio g/i as 1 —+Q (i.e. as AT > P). 

The following statement is true. 

A regular (three times differentiable) curve L without singular points 
has a well-defined absolute torsion at every point where the curvature is 
different from zero. 

We proceed to prove the statement. 

Let P and Af correspond to parameter values t and ¢ + At respec- 
ticoly. The normals to the osculating planes at P and Jf are defined 
hy the vectors [r'r"] p and [r’r"],,**. By the Taylor formula, consid- 


* If the are length is a parameter, then by virtue of the arbitrariness of ? 
4 
and Slit follows from the formula Al = Pr’ (t) | dr that Jr’ (1) J = 4 at 


! 
any point of the curve. Differentiating r’* (2) = 1 we get 2r’ (I) r* (2) = 0, ie. 
rn 1’ is orthogonal to r’ (J). ' 


is computed at points P and Af respectively. 
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ering that [7’r"] = 0, we get 
(rr) xe = [r'r"] p 4- (ir’r"}) p At+aAt= 
<= [xr] p -+ [r’r Jp At + @ Al, (42.19) 


where a —-Q as At —0O. 

To compute the limit of q// as 7 --0 we shall need the value of the 
sine of the angle :~ between the normals to the osculating planes at P 
and Jf. To this end we find the absolute valuo of the vector product 
of [r’r”} p and [r’r"],, and the product of the absolute values of these 
vectors. Using (12.19) yields 


[277 p [2'r" yp) = [[r'r" |p ([r'r"] p+ [r’r”] p At + aAt)]. 


From this, employing the distributive property of a vector product 
and the well-known formula [a (be]] = b (ac) — c¢ (ab) for a double 
vector product, we find 


[[r°r’} p ['r" ar] = rp (rrr) p At +- BAL, 
where § = [[r’r") p a], and therefore B ~0 as At 0. From the last 
expression for [[7’r7J p (7'’7°},,) we obtain the following formula: 
Ulr'r"]p [rr Jad = [rel [(r'r"r" )p] Att yal, (12.20) 


where y —»~-0 as At -+0. 
Similarly we obtain also the formula 


tre Pp] leo bcd = [rr Tbe Bt, (12.24) 


where u —>(Q as Al —0. 

Using formulas (12.20) and (12.21) we obtain the required oxpres- 
sion for sin @p: 

eo ah irr") 4-y) At 

RPS Pre pat 
Note that in this expression the values for the derivatives of the 
vector function r (t) are computed at P. 

Turning to the expression (12.14) for 1, using the just obtained 


furmula for sing and the well-known limit sine —- 1 as 
p — 0 we see that the limit + as 1-»(Q exists and equals 


Lr”) | 

[r'z"}> ° 

So under the hypotheses of the statement the absolute torsion 
[it, | exists and can be found from the formula 


lip} = er (12,22) 


re 
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We define the torsion i, of a curve using the equation 
(r’r’r7) 

~— ft 4 

k,= 4 rrp: (12,23) 
We prove that /. is independent of the choice of parametrization of a 
curve and is therefore a well-defined geometrical characteristic of a given 
curve™, ; 

We proceed to another parametrization of a curve, with the aid of 
the parameter Tt. 

Denoting differentiation with respect to 7 by a dot we obtain the 
following formulas by the indirect differentiation rule: 


r'=rt’ 


r"=rt'*-+- {terms linearly expressible in terms of r}, 


r°= rt3+tf{terms linearly expressible in terms of r and r}. 
From these formulas follow the relations 


rr") =(r ror), [r'r'}2=[r ryt’. 


Thus 
no (irre) (rr or) 
arr ap 


We have shown that k, is independent of the choice of parametriza- 
tion of a curve. 

{2.2.6. Frenet’s formulas, Natural equations of a curve. In Sec- 
tion 12.2.3 we have introduced the concepts of normal and binormal 
of a curve. These together with the tangent are the edges of a tri- 
hedral angle called a@ natural trikedron. Let the arc length be a para- 
meter 2 onacurve L. Thenr’ (/) = tis the tangent unit vector to L. 
Choose the unit vector nm of the principal normal] to be collinear with 
r” (1)** and take as the binormal unit vector 


b = [tn]. (12.24) 


Thus ¢, n, & form a right-handed triple of vectors, i.e. (tab) > 0. 
They are functions of the arc length. We find the expansions of the 
derivatives 7’, mn’, b’ of these functions with respect to t, n, and OU. 
Since t =r’ (2, we have t! =r’ (1). Therefore the vector t' is 
collinear with 7: 


t’ = an. 


* The absolute value | ky] is defined geometrically. Hence parametriza- 
. . . . (r’r’r™ 
tion may influence only the sign of the expression Tye 
** According to the remark of Section 12.2.4 the vector r” (1) is orthogonal tot 
and is in the osculating plane of the curve. 
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According to the remark of Section 12.2.4 a=hk, (a= [vt [= 
= |r" (l) | = &,) and therefore 
t= lyn. (12.25) 


We now turn to the vector &. Since it is a unit vector, 6’ is orthogonal 
to 6. We prove that 8’ is orthogonal to é as well. Differentiating the: 


bh’ 


Osculating 
do splane 
ah 
Rotation of binormat, t > (Y) 
with Z Increasing : 


rs 


Rotatton of principat 
normal, with [ increasing 


Fige 12.8 


identity (bt) = 0 we get (6’t) + (bt’) = 0. Since by (12. 25) (bt’) = 
= k, (bn) = 0, we have (6’t) = O and this means that 6’ is orthogo- 
nal to t. It follows from the above reasoning that b’ is collinear 
with n, i.e. 

b' = Br. (12.26). 


We prove that B= —,. Let om be the angle between the oscu- 
lating planes of a curve at the points corresponding to parame- 
ter values 7? and /-+-Al. Clearly the angle between O6(l) and 
b{l-+ Al) is also equal to g, since 8 is orthogonal to the oscu- 


lating plane. Considering therefore that jim ar =k, we get 


Ai>0 


at 


Consequently, since | B | = | 0’ _ we shave the relation | 8 | = jk, |. 
Let 6’ and m have the same direction. It follows from formula (12.26) 
that in this case B = |b’ Jj, ic. B > O. It is clear that in this case 
r’ (1), r’ (1), and r’” (J) form a triple opposite in sense to the iriple 
t, n, 6 (Fig. 12.8) and therefore (r’r'r’”) <Q, ie. ky <0. Since 
B > 0 and |B | = | k, |, we have f = —kp. In the case where 0b’ 

and m are opposite in direction, a similar reasoning readily shows 
that Bf <0 and &, > O. Since |B | = [ A, |, in this case, too B = —kg. 
When £8 = 0, the equation B = —k, is obvious. So we have proved 
that 


= —he, (12.27). 
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From formulas (42.26) and (12.27) follows the required expression 
for 0’ 


b! == — ton. (42.28) 
We now find the expression for n’. Using the rule of vector product 
difierentiation and formulas (12.25) and (12.25) we get 

n’ = [Bz)’ = [b'7Z] +. [or'] = —k. [nt] -L hy {on} —_ —hyt 7 feab, 


Tabulating formulas (12.25), (12.28) and the expression for n’ just 
obtained we eet the following formulas called Frenet's formulas* 


t' = Kin, 
n= —heyt +b, (12.29) 
b’ = — fron. 


They are also called the basic formulas of the theory of curves. 

It follows from Froenot’s formulas that if the curvatures /, and hk, 
of acurve Z are known, it is possible to find the derivatives of the 
vector functions 7, n, and 8 (i.e. the rates of chango of these functions). 
This naturally suggests that curvature and torsion define a curve ZL, 
which is really the case. Namely, the following statement is true. 

Let k, (l) and kz (1) be any differentiable functions, with k, (ly > 9. 
Then there is a curve unique up to a position in space for which I, (0) 
and }t, (l) are curvature and torsion respectively. 

We shall not prove this statement. Note merely that its proof 
relies on the theorem of the existence and uniqueness of the solution 
of a system of ordinary differential equations. 

Since according to the above statement the curvature i, (1) and 
torsion }:, (1) completely define a curve, the system of equations 


ky =k (Q, My = he (Y) 


is usually called the natural (inirinsic) equations of a curve. 


12.3. SOME FACTS FROM THE THEORY OF SURFACES 


In Chapter 5 we have learned a number of important facts about 
surfaces: we were introduced to the concept of surface, to the con- 
cept of regular and smooth surface without singular points, to the 
concepts of tangential plane and of normal to a surface. Here wo 
shall discuss a number of other important properties of regular sur- 
faces. 

12.3.1. The first quadratic form of a surface. Mensurations on 
a surface. Let @ be a regular surface without singular points and 
Jet r (u, v) be a radius vector of that surface. Then, as is known, 
(rrp)  O. 


* J.F. Frenet (1816-1900) is a French mathematician 
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The first quadratic form I of a surface @ is the expression 
I = dr’. (42.30) 


The term “quadratic form” is associated with the fact that the expres- 
sion 


[=dr'=(r, du+r, dv)? = ri, du?+2r,r, du dv+rjdv* 


isjthe quadratic form of the differentials du and dv. 

The first quadratic form is a positive definite form: it goes to 0 only 
if du = dv = O and is definite for the other values of du and dv, 
Indeed, if dr? = 0, then dr = r, du +r, dv = 0. Therefore, if du 
and dv do not vanish together, then from r, du +r, dv = 0 it 
follows that r, and r, are collinear, i.e. [r,,] = 0, and this is 
impossible since under the hypothesis [r,r,] ~ 0. 

For the coefficients of the first quadratic form the following notation 
is used: 


ri=E, ryr,=F, ri=G. (12.31) 


In this notation the expression (12.30) for the first quadratic form 
becomes 


J] = dr? = FE du? + 2F dudv + G dv’. (42.32) 


So on a regular surface D given by a radius vector r =r (u, v) the 
first quadratic form I is defined by relation (12.32). The coefficients of I 
can be calculated from formula (12.31). 

Using the first quadratic form it is possible to make measurements on 
a surface: to calculate arc lengths of curves and angles between curves 
and to measure areas of domains. 

Let L be a regular curve on @ defined by the parametric equations* 


u=u(t) v=v{(t), fatch, (12.33) 


u (t) and v (t) being differentiable functions with continuous deriva- 
tives. 


* It is clear that representation of u and v as functions (12.33) of some para- 
meter ¢ defines on the surface a curve given by the vector function r (u (t), v (¢)). 
The question whether any smooth curve ZL on ® can be given by parametric 
equations of the form (12.33) can get an affirmative answer, for imstance, as 
follows. Let =x (t), y (t), z (t) be parametric equations of Z. Then wu and v as func- 
tions of ¢ can be determined from the equations z (#) = z (u, v), y (¢) = y (u, v), 
z (i) = 2 (u, v). A solution of the form (12.33) is guaranteed by the condition 


{ru?ol 0 from which it follows, for example, that 7u 70 1£0. The last 


condition ensures the solvability of the system xz (t) = x (i, vy), y (t) = y (u, v) 
for u and v. 
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It is known that the length JZ of the arc of Z defined. by the radius 
vector r = r (u (t), v (t)) can be found from the formula 


t 
[= ( pe’ (2) | dt (12.34) 
to 
seo formula (44.24) in [4]). 
Since |r’ (t) | dt = |r’ (u (t), v (¢)) |dt = [ dr (u, v) | from for- 
mula (12.34) we get 


1 f ir'tar= § \dr (u, vyl= | Var= | VI (12.35) 
to L L L 


(the Jast three integrals in (12.35) aro line integrals of the first kind). 
So if we know the first quadratic form we can use (12.35) to calculate 
lengths. 

We now proceed to measurements of angles on a surface. 

Let a surface @ be given by a vector function r = r (u, v). 

A direction du : dv at a point P on @ is defined as the direction of 
the vector dr =r, du-+-r, dv at that point*. 

Consider at P two directions, du: dv and 6u:6v. The angle 
between them is determined from the analytic-geometry formula 
for the cosine of the angle m between vectors dr = r,, du +r, dv 
and ér =r, du +r, dv: 

(dr-6r) 

V dr? ¥/ br? | 
From this formula we obtain the following expression for cos » 
if we take into account reJations (12.34): 


E dubu--F (dubv-- dvgu)4-G dv §v 
V E dut-+2F du du-+-Gdv? VY béu2+- 20 dues +- Gov 


cos p= 


cos 9 = 


The angle between curves L, and L. on @ intersecting in P can be 
defined as the angle between the directions of the tangents to ZL, 
and Z. at P. Note that if a curve on a surface is defined by the para- 
metric equations u =u (f), v = u(t), then the direction du : du 
at a point of that curve is defined by the vector 


dr=ry,du--r, dv=(rgu’ + rp’) dt. 


So if we know the first quadratic formjwe can use (12.36) to cal- 
culate the angles between directions on a surface. 

The question of measuring the areas on a surface was considered 
in detail in Chapter 5. 

Recall that if a domain JJ on a surface is defined by parameters u 


and v in their domain Q, then the area g of I can he calculated from 


* Obviously, this vector is in the tangential plane ‘fatty. 
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the-formula 
o= \ \ V EG— F? du dv 
Q 


(see formula (5.18)). 

Thus, if we know the first quadratic form we can measure the areas 
of domains on a surface. 

All the facts that can be obtained by measurements on a surface 
using the first quadratic form relate to the so-called intrinsic geometry 
of surfaces. 

Two different surfaces may have the same intrinsic geometry. The 
simplest example of such surfaces is the plane and parabolic cylin- 
der, Note that surfaces having the same intrinsic geometry are called 
isometric. 

12.3.2. The second quadratic form of a surface. Let @ be a regular 
surface defined by a radius vector r =r (u, v) and let x (u, v) be 
the normal unit vector to that surface given by 


_ (rury] {ruts * 
~ Wfrerll ~~ YEG Fe - (12.87) 


The second quadratic form II of a surface is the expression 


I] = —dr dn. (12.38) 
Since dr-n = 0**, d (dr-n) =0, i.e. d@r-n = —drdn, and therefore 
the second quadratic form can also be given by 

I= dren, (42.39) 


Since @r = r,,,, du? + 2r,, dudv +r,, dv’, according to (12.39) 
the second quadratic form may be written as 

IT = (ry) du® + 2 (ry,n) du dv + (royr) dv. (42.40) 

For the coefficients of the second form the following notation is 
used: 

Tyyn=L, ryyrn= M, r,n=N. (12.41) 
Applying to the expression (12.37) for m and using (12.41) we obtain 
the following formulas for the coefficients of IT: 

L= Puululp M= Tuo ulo ; — —7vvlulo 9.4 

V EG—F? ’ / V EG—F? N V EG—F? (12.42) 


12.3.3. Classification of points of a regular surface. We discuss 
deviation of a surface from the tangential plane at a given point. 


* Since | [ryrp] [= VY r2r2—(rurs)*, according to formulas (42.34) 


\{rury] {= V EG— F?. , 
** The vector dr is in the tangential plane to the surface and therefore dr-n =0. 
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Let () be a regular (twice differentiable) surface, Jot r = r (vu, v) 
be a radius vector defining it, let # (uz, v) be the normal unit vector, 
let P (u,v) be a fixed point of the surface, let n,, be the vector nr (u, v) 
at P*, and let AY be a point of the surface corresponding to parameter 
values u + Au, v + Av (Fig. 42.9). 

Let WV be the foot of the perpendicular dropped from Af to tho 
tangential plane x at P and let & be a quantity whose absolute value 


Fig. 12.9 


is equal to the distance from JJ to a, hk having a positive sign if 


the directions of the vectors NV.A7 and np coincide and a negativo 
sign otherwise. Obviously 


h = Arenp, (12,43) 


where Ar =r (u + Au, v + Av) — r(u, v) = PAL. Since u and v 
are independent variables, we may assume Au = du, Av = dv, and 
therefore using the Taylor formula (see formula (12.4)) we get 


Ar=(dr)p+ + (Pr)p+ Ry. (12.44) 


In this relation the differentials are calculated at P, and Ry isa vector 
of an order of o (p*), where p = |“ du® 4- du*. From formulas (12.43) 
and (12.44) we obtain for h the following expression: 


h= + d'rptp+Rynp (12.45) 


Since d*rp+np is the second quadratic form IJ p calculated at P 
and 22otp = o (p*), relation (42.45) may be rewritten as 


h=+-Ip-+o(p%). (12.46) 
Turning to formula (12.46) it is possible to conjecture that hh is 
mainly influenced by the first term, > IJ p, and therefore the spatial 


* In what follows a letter P at the bottom of a vector will indicate that it Is 
taken at a point P. 
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structure of the surface near a regular point is defined by the second 
quadratic form at that point. 

The following arguments support this conjecture. 

1°. ip is of fixed sign (LN — M* > 0). 

In this case* 


[lp] a Ap’, A> 0. 


From this and from relation (12.46) it follows that k maintains sign 
for every sufficiently small value of p and therefore in a neighbour- 
hood of a point P the surface is on one side 
of the tangential plane mp at that point 
(Fig. 12.40). 

Such a point on a surface is called elliptical. 

The sphere, ellipsoid, elliptical paraboloid 
are examples of surfaces each point of which 
is elliptical. 

2°. Ip is alternating (LN — M? < 0). In 
this case we can find two different directions, 
du: dv and 6u : dv, at a point P ona surface 
such that for the values of the differentials of the variables u and v 
corresponding to those directions IJ, vanishes while all the other 
directions are divided by those two into two classes. For the differen- 


Fig. 12.10 


Fig. 42.44 Fig. 12.12 


tials du and du the ratio du : du of which defines the direction of one 
of the two classes the second form II pis positive, and for the ratios 
du : dv defining the directions of the other class it is negative. There- 
fore the surface near a point P is on either side of the tangential plane 
tp at that point (Fig. 12.41). 

Such a point on a surface is called hyperbolic. 

Each point of a one-shected hyperboloid and of a hyperbolic para- 
boloid is hyperbolic. 

3°. Il p is quasi of fixed sign (LN — M* = 0). In this case we can 
find a direction du : dv at a point P on a surface such that for the 


* It is possible, for instance, to show the validity of the inequality | ITp | > 
> Ap? as follows. We have | Iip] =| 0D du? + 21 du dvu+ N de? | = 
={icos?a-+ 2if cosasina + Nsin°a{p*, where cosa = dn/p, sina = 
= dr/p. Since Ip is of fixed sign, the expression | L cos* @ +- 244 cos @ sin a-- 
+N sin? a | has a positive minimum A, i.e. | Wp | > Ap*. 
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values of the differentials du and du defining that direction the second 
form vanishes. For all the other values of the differentials I] p maine 
tains sign*® (Fig. 12.12). 

Such a point P on a surface is called parabolic. 

Gach point of a cylinder is parabolic. 

4°. II; is equal to zero at a point P(L = Af = N = 0). Such a 
point P is called a point of flattening. Figure 12.13 depicts a surface 
with a point of flattening. 

Any point of a plane is a point of flattening. A point with coordi- 
nates (0, 0, 0) on a surface given by the equation s =27*-+ 24 may 
serve as an example of an isolated point of flattening. 


Vig. 12.43 Fig. 12.44 


Note that if all the points of a surface are points of flattening that 
surface is a plane. 

12.3.4. The curvature of a curve on a surface. Let @ be a regular 
surface given by a vector function r = r (u, v), let m be the normal 
unit vector to @, and Jet J be a regular curve on @ with a direction 
du:dv at a point P (1, v). 

Choose the length 2 as the parameter on Z so that r =r (u (), 
v (l)) =r (I) along L. In Section 12.2.6 we have established that 
r” (1) is directed along the principal normal nv, to Z at P and that 
the absolute value of that vector is equal to the curvature / of Lat P. 

Therefore 


r'n =k cos 9, (12.47) 
where mis the angle between the principal normal n, to a curve L 


and the normal n to a surface (Fig. 12.44). By the indirect differen- 
tiation rule 


Pr’ D=ryyt’*+2r, uu +r, veitryu’-r,v". 
Since n is orthogonal] to r, and r,, substituting the obtained expres- 
sion for r” (2) on the Jeft-hand side of (12.47) and taking into account 


* In this case the second form may he represented as the square of some linear 
form of differentials du and dr. 
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formulas (12.41) we get 
rn= (Tuul) wrt 2 (Tuptt) uv" > (Ppt) ye 
= Lu’? +2Mu'v' + Nv". (12.48) 


Since ul = St v= ie and dl?=F du2~2F dudv+Gdvu? on L, 


from (12.47) and (12.48) we get 


Ldu?+2M dudu+N dv? Il 
E du? +-2F dudv-—G dv? an i (12.49) 


kcos@= 
The right-hand side of (12.49) depends only on the ratio du : dv, 
i.e. only on the direction du: dv. Therefore, for all L on M passing 
through a point P in the given direction du: dv the expression for 
k cos p is equal to some constant hr: 


k cos p = k, = const. (12.50) 


In particular, if a curve LZ is the so-called normal section L,, of a sur- 
face M in the direction du : dv, i.e. the line of intersection of PD with 
the plane passing through the normal nm and the direction du : dv, 
then p = 0, cos mg = ft and therefore formula (12.50) becomes 


k = ky. 


Thus &, is the curvature of the normal section of a surface in the 
direction du : dv and can he calculated from the formula 

. Ldu?+-2iMdudv=-Ndv? _ Il ~ 

hn = Edu?--2Fdudv+tGdve I ° (12.91) 
It is also called the normal curvature of the curve L. 

Note that equation (12.50) expresses the content of the Afeusnier* 
theorem. 

12.3.5. Special curves on a surface. 

1°. Asymptotes. A direction du : dv on a regular surface @ at a 
point P is said to be asymptotic if the normal curvature in that 
direction is equal to zero. 

It follows from relation (12.51) that a direction du : dv is asympto- 
tic if and only if for that direction 


L du? + 2M du du + N dv? = 0. (12.52) 


Since the second form vanishes at hyperbolic points, parabolic 
points and points of flattening of a surface, it is only at these points 
that there are asymptotic directions: two asymptotic directions at 
a hyperbolic point, one asymptotic direction at a parabolic point, 
any direction at a point of flattening being asymptotic. 

We introduce the concept of asymptote. 


* Meusnier (1754-1799) is a French mathematician. 
fe 27-01684 
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An asymptote on a surface is a curve whose direction at every point 
Is asymptotic. 

If a regular surface consists of hyperbolic points, it is covered by 
two families of asymptotes, 

For instance, the two families of rectilinear generators of a one- 
sheeted hyperboloid are asymptotes. 

If there are two families of asymptotes on a surface, then they 
may in general be chosen to he coordinate curves uv and v. Jn this 
case along the curve u, for instance, the parameter v remains un- 
changed and therefore the second form has the form II = Z du? 
onu, Since in an asymptotic direction I] = 0 (see relation (12,52)), 
ZL = 0, Similarly it can bo shown that NV = 0. So, if the asymptotes 
of a surface are coordinate curves, then the second form is as follows: 


II = 2a du dv. 


2°. Principal directions. Lines of curvature. {t is clear from formula 
(12.91) that the normal curvature at a given point is a function of du 
and dv, more precisely, of the ratio du/dv, i.e. of the direction 
du:dv ata given point. 

Extreme values of the normal curvature at a given point are 
called principal curvatures, and the corresponding directions are 
principal directions. 

We show that there are always principal directions at a given 
point of a regular surface. 

Putting 


du dv 


a = 00S A eee = SING, 
/ due -! dee " du? fare 


we transform the expression (12,54) for &, into the form 


Leowta4-2M cosasing -}- Nsintia 
i costa--2F cosasina+Gsinta 


. I, = . 
Thus the normal curvature i, at a given point is a differentiable 
function of the independent variable @ given on a closed interval 
{(0, 2a] and taking on the same values for @ = 0 and @ = 2a. At 
some interior point @ of that interval therefore /:,, has a local extremum. 
Corresponding to that value of @ is a direction du : dv on the surface 
which is naturally a principal direction. Hf we start measuring angles 
@ from that principal direction, then, reasoning similarly, wo shal] 
see that an extremum of normal curvature is attained for at Jeast 
one more direction du: dv. 

50 there are at least two different principal directions at each point 
of a regular surface. 

We show a method of computing principal curvatures at a given 
point. 
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Assuming k, to be a function of du and dv we obtain from (12.54) 
-the following identity in du and dv: 


(L—k,E) du* +2 (M—k,F) du dv+(N —k,G) dv? = 0. 


Differentiating it with respect to du and to dv and considering that 
the derivative of the normal curvature for a principal direction is 
equal fo zero we obtain for du and dv corresponding to any principal 
direction the relations 


(L—k,E) du+(M —k,F) dv = a 
(M—k;F) du -- (N —Ik;G) dv = 0, 


where k; is the value of a principal curvature in the direction du: dv: 
Since there are principal directions at every point, the system (12.53) 
has nonzero solutions for du and dv. Consequently, the determinant of 
(12.53) must be equal to zero: 


L—k,.E M—kF 
M—k,F N—k,G|~ 


From equation (12.54) we can determine principal] curvatures /;, 
and the principal directions can then be found from relations (42.53). 

Equation (12.54) is a quadratic equation in k; whose real roots 
are principal curvatures. Therefore two cases are possible: 

4°. Equation (12.54) has two distinct roots, k, and ky. 

2°. The roots k; of (12.54) coincide. We consider these cases sepa- 
rately. ; 

1°. Equation (42.54) has two distinct roots: ky and ky, ky ~ ky. Twa 
different principal directions correspond to them. We show that 
if the directions of the coordinate curves u and v at a given point coincide 
with the principal directions, then at that point F =0 and M = 0. 
Note that the vanishing of F implies that the principal directions are 
orthogonal. 

So let the directions of the coordinate curves u and v at a given 
point coincide with the principal directions. This means that the 
directions du : 0, 0: du are principal directions and therefore rela- 
tions (12.53) yield 


L~—kE = 0, M —k,F = 0, 
M—kF=0, N—kG =O. 
Since k, = k., clearly M = 0, F = 0. Note that with that chsicte 


of coordinate curves the principal curvatures k, and k, can be femme 
from the relations 


N 
G* . 


(12.53) 


(12.54) 


ky = 
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2°, Equation (12.54) has two coincident roots: I, = kh. = kh. We 
show that in this case any direction at a given point isa principal diree- 
tion. If the coordinate curves at a given point are orthogonal, then at 
that point F =O and VM = 0. 

We have already noted that there are at least teco different prin- 
cipal directions at every point. In the present case to either of the 
principal directions corresponds the same value ¢/ of the principal 
curvature. But then the coefficients of the system (12.53) must vanish, 
i.e. 


L—-kE=0, A-kF =0, N —iG = 0. 


From these equations it follows that at a given point the coefficients 
of the second form are proportional to the coefficients of the first 
form: 


L=tkk, M=hkF, N = ke. 


Substituting these values for Z, 37, and N in formula (12.51) we see 
that at a given point the curvatures of normal sections in any direc- 
tion du : dv are the same and equal /&. Consequently, any direction 
du:dv at a given point is a principal direction. 

If the coordinate curves at a given point are orthogonal, then 
f= Q, and from the relation AJ —*AF = 0 it then follows that 
Al =), too. 

So we may draw the following conclusion: (here are orthogonal 
principal directions at each paint of a surface. If the directions of 
coordinate curves coincide with those principal directions, then at that 
point F = Q and M = 

We introduce the concept of line of curvature. 

A line of curvature on a surface is a curve whose direction at every 
point is principal. 

There are in general two different families of lines of curvature 
on any regular surface (we have pointed out above that there are 
two different principal directions at every point). 

Note that if we choose lines of curvature as the coordinate curves, 
then the first and the second form of a surface will he as follows: 


L= du +- G de-, 
Tl = Ldu? + N dv*, 


since #& = Q and JJ = 0. 

3°. Geodesics. AA geodesic (line) ou a surface is a curve the principal 
normal at each point of which coincides with the norma] to the sur- 
face. 

Any two points of a regular complete surface can be connected by 
a geodesic. If the points are sufficiently close, then the geodesic 
connecting (hem is the shortest curve, any other surface curve con- 
necting the points will be of greater length. 
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Note that the motion of a point on a surface without external 
influence takes place along a geodesic. 

12.3.6. The Euler formula. The mean and the Gaussian curvature 
of a Surface. The Gauss theorem. Let P be a fixed point of a regular 
surface ®. We shall assume that coordinate curves u and v are ortho- 
gonal at the given point and that the directions of the curves coincide 
with the principal directions, We have established in Section 12.3.5 
that under such a choice of coordinate curves at a given point 


F=0, M=0, L—hE=0, N—kG =O. 


Using these relations makes formula (12.54) for a normal curvature 
k, take the form 
k= hE du®?+-koG dv? 
n Edut-+Gdav 


If we set 
_ V Edu 
cos (= YV Edut?+Gdvi ’ (12.55) 
sing = Y Gdv 


VY E du2--G dv? ’ 


then obviously we obtain the following formula for normal cur- 
vature; 


k, =k,cos? p+, sin? . (12.56) 


Formula (12.56) is called the Euler formula. Using it a normal cur- 
vature k, in a direction du : dv can be calculated in terms of prin- 
cipal curvatures i, and kg. 

Obviously the Euler formula and formula (12.50) give complete 
information about the location of the curvatures of curves on a sur- 
ace. 

Remark 1. The angle m in the Euler formula whose value for a 
given direction du: dv can be found from formulas (12.05) is that 
angle which the direction du : dv makes with the direction of the 
coordinate curve wu. 

To see this calculate from formula (12.36) the cosine of the 
angle beetween the direction du:dv and the direction du:0 of 
the curve u. Setting in formula (12.36) 6u=du, 6v=0 yields 

VY E du 
; V E du? +G dv? 
cides with the expression for cos (p determined from the first of the 
formulas (42.05). 

Eixtensively used in the theory of surfaces are the concepts of 
mean and Gaussian curvature of a surface at a given point. 


for the desired cosine the expression which coin- 
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The mean curvature / of a surface is a half-sum (hy =- ka) 


of the principal curvatures. The Gaussian curvature K of a sur- 
face is the product It,k. of the principal curvatures. 

Turning to equation (12.54) for principal curvatures and using the 
properties of the roots of a quadratic equation we obtain the following 
formulas for 7 and A: 


G—2MFANE 


,_ LN—M? 9 
K = EGE (12.58) 


Remark 2, From the expression (12.58) for Gaussian curvature it 
follows that its sign coincides with that of the discriminant LN’ ~— .M°- 
of the second quadratic form (the discriminant EG — F* of the first 
quadratic form is always positive since the first form is positive 
definite). Therefore Gaussian curvature is positive at elliptical 
points, negative at hyperbolic points and equal to zero at parabolic 
points and points of flattening. 

The Gaussian curvature £ of a surface, on the face of it, can he 
found only if we know the first and second quadratic forms of the 
surface (see formula (12.58)). 

In fact, it may be expressed in terms of the coefficients of the first 
quadratic form alone and is therefore the object of the intrinsic 
geometry of a surface. This remarkable result was established hy 
Gauss* and is called the “famous Gauss theorem” in the mathematical 
literature. We prove this theorem. 

Theorem (Gauss). TheGausstancurvature K of asurface can be expres- 
sed in terms of the coefficients of the first quadratic form of the surface 
and their derivatives. 

Proof, Turning to formula (12.58) for Gaussian curvature A 
and using the expressions (12.42) for the coefficients of the second 
quadratic form it is casy to sce that to prove the theorem it suffices 
to express the following expression in terms of the coefficients of tha 
first quadratic form and their derivatives: 


A = (Puulule) Cnelul'e) ~ Puch ule)” 
This expression is casy to transform to the form** 


a 
Puuler Pur Puulu “uulo 0 Puclu Puls 
As Fulop E Pr ~ Puluo iE iD (42.59) 
reg I G rivuyp &F G 


* Carl Friedrich Gauss (1777-1855) is a prominent{Genman mathematician. 
** The following identity is used in the transformation: 


QyAs a,b. QC. 
bia. bb. byes 
€;9, Cb, eye, 


(a3b,c,) (a,b,€,) = 
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Differentiating with respect to u and v the expression 
r= E, rury =F, r= 


yields 


4 4 4 
Puulu = a fu: Tuolu =z Ao: Poole = > Gy, 


1 4 4 
Tul yp = 5 Cus Puuly =F u— > Eo: Pylu= Py —z Gy. 


Differentiating the expression for r,,r, with respect to v and the 
expression r, p7”, with respect to u and subtracting the results obtained 
we get 


4 4 
Yuul pp — Tw = ~> Guut Pur > Evv- 


Substituting the expression obtained and the expressions for the 
scalar products of the derivatives on the right-hand side of (12.59) 
we see that the theorem is true. 

In conclusion we give the expression for Gaussian curvature K 
in terms of the coefficients of the first quadratic form and their 
derivatives: 


(— 5 Guu+ Fury Evo) Eu( Fu Eo) 


4 4 
K=cg—pr| (Poa Gu) BOF |— 
4 
4G, P G 
4 4 
0 +z, +6, 
4 4 
—giger|ze EF 
5G. F G. 


APPENDIX 


EVALUATION OF FUNCTIONS FROM APPROXIMATE 
VALUES OF FOURIER COEFFICIENTS 


A.1. The problem of summing a Fourier trigonometric series with 
approximate values of Fouricr cocfficients. Suppose first that a func- 
tion f (z) satishes conditions ensuring the uniform convergence of 
its Fourier trigonometric series 


feo] 

<2 +- S' (a, cos kz +b, sin kz) (A.1) 

eed 
on the whole interval [—a, x]. Suppose further that instead of the 
exact value of the Fourier trigonometric coefficients @;, and by of 
that function we know only their approximate values a), and bp. 
It is this case that is very common in applications. 

We shall assume that errors in assigning approximate values to 
Fourier trigonometric coefficients are small in the sense of the 
norm /**. This means that 


~ Lee) 
(Qg—ap)* ~*~. \o 2 J, \2 9 ‘ 
a >) (dy — ay)? + (by, — 5,2? <&, (4.2) 
k=1 
where 6 is a sufficiently small positive number which we shall call 
the error in assigning Fourier coefficients. 
The problem important for applications naturally arises: given 


approximate values for Fourier coefficients, @, and b) regenerate at 
a given point z, the function f/(z) with an error « (5) tending to zero 
as § +0 

We show that direct summation of a Fourier series with approxim- 
ate Fourier coefficients 


~, 


te 
a ~ 7 . ‘ 
++ S' (ay cos kx +b, sin kz) (A.3) 
ho] 
cannot in general regenerate f (z) at a given point x to any precision. 


We fix an arbitrarily small error 6>0 and set c-V 9 — 


heal 
Suppose that errors in assigning Fourier coefficients have the Jol- 


* For the definition of a space and the norm of its elements see Section 11.4.1. 


Appendiz 425 


lowing particular form: 


~ ~ 5 . 
dg— 4) = 0, An — Gx = Dy — On = He with k=, 2, ere 


For Fourier coefficients with such errors relation (A.2) with the sign 
of exact equality will obviously be true. At the same time, replacing 
the exact Fourier series (A.1) by a Fourier series with approximate 
coefficients (A.3) causes an error equal to the sum of the series 


SY (@,—a,) cos kr-+-(b, —b,) sin Az 


(however small the error 6 > 0 fixed). 

Thus however rapidly the exact Fourier trigonometric series (.A.1) 
may converge to f (z) and however small 6 in relation (A.2) giving 
the deviation of the approximate Fourier coefficients from the exact 
ones may be. direct summation of a Fourier series with approximate 
Fourier coefficients (A.5) cannot regenerate 7 (t) at a given point of 
{—x, a] to any precision. 

In fact we have proved that however small the number 6 > 0 
characterizing the deviation from each other, in the sense of (:A.2), 


of two sets of Fourier coefficients, {a,, b.} and {a;, &,}, correspond- 
ing to the two sets. the direct sums of Fourier trigonometric series 
(A.4) and (4.3) may differ arbitrarily from each other. 

Such problems for which an arbitrarily smal] deviation in initial 
data (in the case above these are a set of Fourier coefficients) may 
cause an arbitrarily large deviation of solutions corresponding to 
those initial data (in the case considered above, by a solution we 
mean the direct sum of a Fourier trigonometric series) are common 
in mathematics and in applications and are called incorrectly set 
problems. 

In other words, the direct summation problem we have considered 
is an incorrectly set problem. 

A general method of solving a wide class of incorrectly set prob- 
lems has been developed by the Soviet mathematician A.N. Ticho- 
nofi and is called the regularization meihod*. — 

Here we shall discuss it only as applied to the summation problem 
considered. 


* A.N. Tichonoff was awarded a 1966 Lenin prize for a series of works on 
solving incorrectly set problems. 


23-168. 
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A.2. The regularization method for the problem of summing a Fourier 
trigonometric series. As applied to the problem of summing a Fourier 
trigonometric serics with approximate Fourier coefficients the 
regularization method Jeads to an algorithm which regards as an 
approximate value of f (z) not the sums of the series (A.3) but the 
sums of the series 


7 ooo! ~ 4 
> + 2 (a; cos kxr-+- b, sin kr) Tre (A.4) 


obtained by multiplying the Ath term of the series (A.38) by a 


“recularizing” factor » Where @ isa paramoter of the same 


4--h3a 
order of smallness as 6 in relation (A.2) giving the deviation 
of Fourior coefficients. 

To substantiate the above algorithm wo shall prove the following 
main theorem. 

Theorem (AN. Tichonoff’s theorem). Let a function f (x) belong 
to a class L* [—x, x] and be continuous at a given point x of {—x, x). 
Then for every 6 > 0 and for a having the same order of smallness as 
§ the sum of the series (A.4) with coefficients ay and b), satisfying relation 
(A.2) coincides at the point x with f (x) with error e (5) tending to ccro 
as § —0*. 

Proof. We may assume without loss of generality that a = 6 
(for the case a = C (6)-6, where O<C, <C (6) <Cy,, can he 
considered quite similarly). It suffices to prove that, given any ¢ > 0, 
wo can find 5, (e) > Osuch that at a given point z for’every positive 6 
satisfying the condition 6 < 6, 


foe Ace « , { 

+ 2s {a,, coshz-+-b, sin ha) eaayaR — f (2) <Ee (A.5) 
Fix an arbitrary ¢ > 0. We shal) first show that for the fixed e 

there is a number 46, (e) > O such that for every positive 6 satisfying 

the condition 5 = 6, (e) 


Se > [(a,—a,) cos hex + (by, — b;,) sin hay] 
hex4 


4 & 
x pos |< (A.6) 


To establish (A.6) it suffices to show that the sum on the left’of (A.6) 
tends to zero as 6 —+-+0. 


° This theorem is a special case of the much more general statement proved 
by A.N. Tichonoff. 
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Splitting the sum on the left of (A.6) into two sums the first of 
which contains terms with */ satisfying the condition * < 41/5 and 
the second all the other terms, and applying to each of the two sums 
the Cauchy-Buniakowski inequality we have* 


q—4y WA at ~ 1855 
“yet [(a;, — a) cos kx-+ (6, —b;) sin kz} TOES I< 

ee 
</ { Aor toy + py (4, —ay)°-+ Ox—bx)"1} O (=) + 

Roe 

{- ~ ° 9 1 - 

+ / » [(a, —a,)°-+ (6; — 8, )"]- » 62° (A.1) 
| oe > 


Taking into account relation (A.2) and considering that 


YS w= 08) 


R216 


(for instance, by virtue of the integral Cauchy-Maclaurin test, see 
inequality (13.38) in Chapter 13 of [1])} we obtain a quantity 
0 (V8) + O (89°) on the right of (A.7). 

This inequality (A.6) may be taken for granted and*®tofestablish 
inequality (A.5) it suffices for us to prove that for theyfixed e« > 0 
there is a number 6, (¢) > 0 such that for every positive 5 satisfying 
the condition 6 < 4, (s) 


c ; 4 3 
B+ 2 (a, cos kx-4-b, sin het) peg — ft (2)| <= g. (A.8) 
k= 


Since as stated 7 (z) is continuous at a given point z, for the fixed 
& > 0 we canffix 7 > 0 such that for every value of y satisfying the 
condition |y—z][<y 


Fw—fal<z. (A.9) 


Now set y = 1/ /6 and consider for the fixed x and theffixed yn > 0 
a function v. (y) defined on a half-open interval z— n<?y < 


* We also take into account the fact that Tee <!: Tras < TF: 


288 
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<zr—y+2a by the equation* 


A7 Je-viz-1! whe — 4 
vy w=| 2 e When z—yn<uvr+y, (A10} 


when z24-qnxSySz—y+ 2a 


and periodically with period 27 extended to the whole infinite line 
—oo Tyco. 

Compute the Fourier trigonometric coefficients A; and B, of 
vy (yu). 

From equation (A.10) and from the periodicity condition of vy (y) 
with period 2x we get** 


‘ x1] XT] 
U 
A, => { v, (y) cos ky dy = ( e-Tle-Yl cosky dy = 
x1 x-) 
Ty bt} 
at [ e-¥iMeosk(t-+2) dt = Leos ke | e~? "cos kt dt — 
ny 7 =n 


” nN 
_ + sin dz | e-vltl sin kt di = y cos kx { e~¥b cos Ist dt. 
: 0 


Further, since 


7 “ 
° ~ 1" (—y cos kt 4-k sin kt) Vf=n ’ , 
| ew COS ict ¢=[—_ eee | = - = a ¢-'N-o) 
. Bee .f- 49% t= k _y 
0 
where 
—ycoskn—s:sink 
Go), = 7 £08 A ST (A.41) 


KF y? 
considering that 6 = 1/y* yields the following expression for thie 
Fourier coefficient Aj,: 


_ coskz 
A, = {-1-k26 


+e-"coskzr-y-o,. (A.12) 


__ 
* We may assume without loss of generality that yn < 1. 
#* Wo take into account the fact that all integrals of a periodic function over 


intervals of length, equal to its period, coincide; we make a change of 
n 


variable y == ¢-+ 2 and take into consideration that \ en TIE coe kt dt = 
=n 
n 1 
a) | e~*eos kt dt, | eT Nsin py de = 0, 
0 


“TN 
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Quite similarly it canbe established that 


k . 
B, = aah +e" sin kxr-y-Op. (A.413) 


Since as stated f(y) belongs to L*[—a, x] and since v,, (y) 
belongs to the same class for any 56=—~> 0, we have the gen- 
eralized Parseval formula (see equation (11.28)) 

n 

= | ve (y) f(y) ay =A 43 (Ag+ dy + By by). (A.14) 
— IT 

From relations (A.42), (A.13), and (A.14) it follows that to prove 


inequality (A.8) it suffices to establish that for every sufficiently 
smal] positive 6 


a 
1 
le | vs (uf) dy—fl2)| <5 (A.15) 
— I 
|. evo, + e7 Nv > GO; (a; cos kz + by sin kz) |< “- . (A.16) 
k=1 


We extend f (y) periodically with period 2n to the whole infinite 
line. 

To prove inequality (A.15) notice that by the 2n-periodicity of 
vx (y) and f(x) and by equation (A.10) 


ar X — T+ 25 
=\uwMivda=> | vuw@iwMdy= 
x+n ~ xX*N 
=f e-vle- ty) dy= Fla) |e? R-ul ay+ 


x—T) x~T} 


’ 
al 


+E | WP @lerve-wl dy. (A.17) 


x— 7 
Considering that* 


x+7 n nN 
ms \ envie-vl dy = | envi dt=y | e-vi dt=41—e-m 
x-) -T 0 


* In computing this integral we make a change of variable y = i + z. 


434) fundamentals of Mathematical Anclysts 


and that for any yv in [z°— y, z + n] inequality (A.9) is true we 
get with the aid of relation (A.17) 


AC nw fwar—t(a{< 


e 


Serv] f (x)| +L (Ler) Se-M] f (x) HE. 


Since e-“" | f (x) |< c/4 for every fixed point z and for the fixed 
numbers & >O and y > 0, with all 6 = 1/y* sufficiently small, 
relation (.15) is proved. 

It remains to prove inequality (A.16). From (A.11) it is obvious 
that for G,, at all & = 1, 2, ..., we have the estimate 


lo, |< Qk. (A.48) 


For Gq in (A.11), with all 6 = 14/)* sufficiently small, we obtain the 
estimate 


| sy, I< iy << 4, (A.19) 


Applying to the sum on the left of (A.16) the Cauchy-Buniakowski 
inequality and using the estimates (A.18) and (A.19) we get* 


o 
| 3 e~*MyGy-+e-My >’ a, (ay, cos kx + by sin kz l\< 
k=l 


<2e- My] Sep » (ak +04) |". [4 4-4 > “|. (A.20) 
heaf h=1 


The two sums in square brackets on the right of (A.20) are bounded 
by a constant (independent of 6). The boundedness of the first sum 
follows immediately from the Bessel inequality and that of the 
second has been proved in Chapter 13 of {4). 


Since for any fixed n>0O, lim e~ vn, y= lime oA 0, the 


right-hand side of (A.20) for any ‘fixed e>0 is less than e/4, 
with all pusitive 6 sufficiently smaJ}. Thus the theorem js 
proyed. 

A.3. Concluding remarks on the importance of the regularization 
method. A. N. Tichonoff's regularization method is of groat scientific 
significance. 

Suppose that we use some instrument to measure the frequency 
responsesfof a physical process of interest. Because of the instru- 
ment's imperfection our measurements are somewhat in error. 


* We majorize by unity the absolute values of cos kz and sin kr. 
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rylhe problem naturally arises: if we are to clarify as much as pos- 
sible our idea of the process, should we improve indefinitely the 
instrument's precision or should we instead develop such mathe- 
matical methods of data processing which would allow us, with the 
measurement accuracy available, to obtain maximum information 
about the process. 

The regularization method shows a way to such a mathematical 
treatment of the observed data (i.e. of Fourier coefficients) which 
gives us information about the physical phenomenon under study 
di.e. about the desired function f (z)) with an error corresponding to 
the error in the observed data. 
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Gibbs, 146 Riesz, 244 
Green, 170 Schwarz, 133 
Haar, 303 Stirling, 284 
Kronecker, 144 Stokes, 182 
Lamé, 164 Stone, 53 


Laplace, 159 


SUBJECT INDEX 
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